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RESULT 1 
AAR84522 

ID AAR84522 standard; protein; 247 AA. 
XX 

AC AAR84522; 
XX 

DT 19-APR-1996 (first entry) 
XX 

DE Stanniocalcin from Corpuscles of Stannius . 
XX 

KW stanniocalcin; Corpuscles of Stannius; hypocalcaemia ; hypercalcaemia ; 

KW electrolyte disorder; osteoporosis; Paget 1 s disease; treatment. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 



FT 


Region 


1. .33 




FT 




/label= prepro region 




FT 


Protein 


34. .247 




FT 




/label= mature stanniocalcin 




XX 








PN 

C IN 


W09524411-A1 . 






XX 








PD 


14-SEP-1995 . 






yy 








PF 


09-MAY-1994; 


94WO-US005136. 




XX 








PR 


08-MAR-1994; 


94US-00208005. 




yy 








PA 


(HUMA-) HUMAN GENOME SCI INC. 




yy 

AA 








PT 


01 sen H, Adams 


MD; 




XX 








JJrs. 


WPI; 1995-328227/42. 




DR 


N-PSDB; AAT02438. 




AA 








PT 


Human corpuscles of Stannius polypeptide ( s ) - used to treat 




C 1 


hypercalcaemia, 


hypocalcaemia and other electrolyte disorders. 




VV 
AA 








DC 
IT O 


Claim 14; Fig 1 


; 41pp; English. 




yy 








cc 


Stanniocalcin, 


a Corpuscles of Stannius polypeptide is encoded by 




cc 


AAT02438. Stanniocalcin functions as a hypocalcaemic agent, and can 


be 


cc 


used for the treatment of e.g. electrolyte disorders which lead to 


renal, 


cc 


bone and heart 


diseases, hypertension, hypercalcaemia and disorders 


due 


cc 


to elevated bone resorption, e.g. osteoporosis and Paget 1 s disease 




XX 








SQ 


Sequence 247 AA; 





Query Match 100.0%; Score 1268; DB 2; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I hi I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIT^NGVTSKVFLAIRRCSTFQRM 120 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 I AEVQEEC YS KLNVC S I AKRNP EAI TEWQLPNH FSNRYYNRLVRS LLECDEDTVST I RD 180 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

241 RTSHESA 247 

I I I I I I I 
241 RTSHESA 247 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



RESULT 2 
AAB23264 

ID AAB23264 standard; protein; 247 AA. 
XX 

AC AAB23264; 
XX 

DT 02-FEB-2001 (first entry) 
XX 

DE Human stanniocalcin . 
XX 

KW Human; stanniocalcin; STC; osteogenesis; bone disease; osteoporosis; 

KW mineral metabolism regulator; prophylaxis; therapy. 

XX 

OS Homo sapiens. 
XX 

PN JP2000229880-A. 
XX 

PD 22-AUG-2000. 
XX 

PF 10-FEB-1999; 99 JP-00033262 . 
XX 

PR 10-FEB-1999; 99 JP-00033262 . 
XX 

PA (SNOW ) SNOW BRAND MILK PROD CO LTD. 
XX 

DR WPI; 2000-605236/58. 
DR N-PSDB; AAA97594. 
XX 



PT An osteogenesis promotor useful in the prevention and/or treatment of 

PT bone diseases such as osteoporosis. 

XX 

PS Example 1; Page 5-6; 6pp; Japanese. 
XX 

CC The invention relates to a novel osteogenesis-promoting composition which 

CC contains stanniocalcin (STC) as the active component. Stanniocalcin is a 

CC possible regulator of mineral metabolism. The composition is useful as a 

CC prophylactic and/or therapeutic agent for bone diseases such as 

CC osteoporosis. The present sequence represents human stanniocalcin which 

CC was used in an exemplification of the invention 



XX 

SQ Sequence 247 AA; 

Query Match 100.0%; Score 1268; DB 3; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I > I M I I I I i II 



Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 



Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 
Db 241 RTSHESA 247 



RESULT 3 
AAY55750 

ID AAY55750 standard; protein; 247 AA. 
XX 

AC AAY55750; 
XX 

DT ll-FEB-2000 (first entry) 
XX 

DE Human stanniocalcin polypeptide. 
XX 

KW Stanniocalcin-alpha polypeptide; teleocalcin; hypocalcin; hypercalcemic; 

KW anti-hypercalcemic glycoprotein hormone; corpuscles of stannius; PTH; 

KW parathyroid hormone; bone reabsorption; osteoporosis; gene therapy; bone; 

KW electrolyte disorder; renal; heart disease; osteopetrosis; human; 

KW Paget ! s disease; hypercalcemia. 

XX 

OS Homo sapiens. 
XX 

PN US5994103-A. 
XX 

PD 30-NOV-1999. 
XX 

PF 02-JUN-1995; 95US-00460529 . 
XX 

PR 10-NOV-1994; 94WO-US013206 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Fleischmann RD, Olsen HS; 
XX 

DR WPI; 2000-038260/03. 
XX 

PT Isolated nucleic acids encoding human stanniocalcin-alpha useful for 

PT treating electrolyte disorders which lead to renal, bone and heart 

PT diseases, osteoporosis and Paget 1 s disease. 
XX 

PS Disclosure; Fig 3; 21pp; English. 

XX 

CC The invention provides a human stanniocalcin-alpha polypeptide (also 

CC called teleocalcin and hypocalcin) . Stanniocalcin-alpha is an anti- 

CC hypercalcemic glycoprotein hormone produced by the corpuscles of 

CC stannius. It has a similar reported biological activity to that of 

CC parathyroid hormone (PTH) and both these proteins exhibit dual functions 

CC in mammals. They exert hypercalcemic activity due to stimulation of bone 

CC reabsorption. Further PTH has a biphasic action on bone metabolism (i.e. 

CC at low doses it increases bone formation and at high doses it increases 



CC bone reabsorption . Accordingly, human stanniocalcin-alpha and antagonists 

CC of it (under different circumstances) may be used to treat osteoporosis. 

CC The DNA may be used to produce human stanniocalcin-alpha according to 

CC standard recombinant DNA methodologies . The human stanniocalcin-alpha may 

CC be produced either in vitro in a fermentation culture or in vivo as part 

CC of a gene therapy protocol, and may be used to treat electrolyte 

CC disorders which lead to renal, bone and heart diseases. Due to the 

CC biphasic nature of stanniocalcin-alpha it may be used to treat 

CC osteoporosis, osteopetrosis and Paget 1 s disease. Alternatively, the 

CC polypeptides may be used as antigens in the production of antibodies to 

CC stanniocalcin-alpha and to assay for agonists and antagonists of its 

CC activity. The antibodies and antagonists may be used to inhibit the 

CC activity of stanniocalcin-alpha and may be used to treat osteoporosis and 

CC hypercalcemia. The present sequence represents a human s tanniocalcin 

XX 

SQ Sequence 247 AA; 

Query Match 100.0%; Score 1268; DB 3; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 

M I I I I I I I II I I I I I I I I I I I I! I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I II I I I I I I I I I I M IMIII 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 

Db 241 RTSHESA 247 



RESULT 4 
AAY57166 

ID AAY57166 standard; protein; 247 AA. 
XX 

AC AAY57166; 
XX 

DT ll-FEB-2000 (first entry) 
XX 

DE Human corpuscles of stanius polypeptide. 
XX 

KW Corpuscles of stanius polypeptide; calcium; inhibition; human; renal; 

KW therapeutic; bone; heart disease; hypocalcemia; osteoperosis . 

XX 

OS Homo sapiens . 
XX 



PN US5994301-A. 
XX 

PD 30-NOV-1999. 
XX 

PF 28-APR-1995; 95US-00431117 . 
XX 

PR 08-MAR-1994; 94US-00208005 . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 
XX 

PI Adams MD, Olsen HS; 
XX 

DR WPI; 2000-038269/03. 

DR N-PSDB; AAZ39520. 
XX 

PT Human corpuscles of stanius polypeptides used to inhibit calcium uptake. 
XX 

PS Claim 4; Fig 1A-B; 23pp; English. 
XX 

CC This represents a human corpuscles of stanius polypeptide, having a 

CC calcium uptake inhibitory activity. The cDNA is deposited under the 

•CC accession number ATCC Deposit No. 75652. The polypeptide can be used in a 

CC method for the treatment of a patient having need to inhibit uptake of 

CC calcium. The method is also used for the therapeutic treatment of renal, 

CC bone, and heart diseases, and the antagonist (may be an antibody) may be 

CC used for treating hypocalcemia., and osteoperosis 

XX 

SQ Sequence 247 AA; 

Query Match 100.0%; Score 1268; DB 3; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I 1 I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 
1 MLQN S AVL LVLVI S AS AT H EAEQN D S VS P RK S RVAAQN S AEVVRC LN S ALQVG CGAFAC L 60 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

61 ENSTCDTDGMYDICKSFLYS7VAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I M I I I I I I 

121 IAEVQEECYSKLNVCSI.AKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

241 RTSHESA 247 

I I I I I I I 
241 RTSHESA 247 



RESULT 5 
AAY92901 

ID AAY92901 standard; protein; 247 AA. 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



XX 

AC AAY92901; 
XX 

DT 26-SEP-2000 (first entry) 
XX 

DE Human s tanniocalcin protein. 
XX 

KW PCR primer; human; stanniocalcin; inhibitor; differentiation; maturation; 

KW adipocyte; obesity; diabetes; hypertension; heart disease. 

XX 

OS Homo sapiens. 
XX 

PN WO200016795-A1. 
XX 

PD 30-MAR-2000. 
XX 

PF 17-SEP-1999; 99WO- JP005080 . 
XX 

PR 17-SEP-1998; 98 JP-00263004 . 
XX 

PA (SNOW ) SNOW BRAND MILK PROD CO LTD. 
XX 

PI Goto M, Tomoyasu A, Yamaguchi K, Kinosaki M, Nakagawa N; 
XX 

DR WPI; 2000-283445/24. 
DR N-PSDB; AAA11145. 
XX 

PT Treating or preventing obesity, which is a risk factor for diabetes, 
PT hypertension and heart disease, comprises administering an agent 
PT containing stanniocalcin. 
XX 

PS . Example 1; Page 16; 19pp; Japanese. 
XX 

CC This sequence represents the human stanniocalcin protein. Stanniocalcin 

CC is an inhibitor of the differentiation and maturation of adipocytes. The 

CC protein is used for preventing and treating obesity which is a risk 

CC factor for diabetes, hypertension, and heart disease. The coding sequence 

CC was isolated from IMR-90 cells 

XX 

SQ Sequence 247 AA; 

Query Match 100.0%; Score 1268; DB 3; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MLQN S AVLLVLVI S AS AT HEAEQN D S VS P RK S RVAAQN S AE WRC LN S ALQ VGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 



Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 



Qy 241 RTSHESA 247 

I I I I I I I 

Db 241 RTSHESA 247 



RESULT 6 
AAB62473 

ID AAB62473 standard; protein; 247 AA. 
XX 

AC AAB62473; 
XX 

DT 09-JUL-2001 (first entry) 
XX 

DE Human stanniocalcin (STC) protein. 
XX 

KW Stanniocalcin; STC; neuroprotective; antiinflammatory; antianemic; 

KW antirheumatic; antiarthritic; dermatological ; antiallergic; human; 

KW nephrotropic; antithyroid; immunosuppressive; antidiabetic; vulnerary; 

KW antiasthmatic; hemostatic; antiarrhythmic; vasotropic; antipsoriatic; 

KW antitumor; anti HIV; antiparkinsonian; nootropic; cell proliferation; 

KW hemostatic; thrombolytic. 

XX 

OS Homo sapiens. 
XX 

PN WO200130969-A2 . 
XX 

PD 03-MAY-2001. 
XX 

PF 20-OCT-2000; 2000WO-US029432 . 
XX 

PR 27-OCT-1999; 99US-0161740P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 

PA (UYHE-) UNIV HELSINKI. 

XX 

PI Olsen HS, Zhang K, Lindsberg P, Tatlisumak T, Kaste M; 

PI Andersson LC; 

XX 

DR WPI; 2001-308626/32. 

DR N-PSDB; AAF83297. 
XX 

PT Novel composition useful for treating or protecting neural cells, for 

PT treating Addison's disease, organ rejection, hyperprolif erative disorder, 

PT cancer, AIDS, multiple sclerosis, comprises stanniocalcin polypeptide. 
XX 

PS Claim 1; Fig 1A-C; 253pp; English. 
XX 

CC The invention relates to a human stanniocalcin (STC) polypeptide. A 

CC pharmaceutical composition comprising the STC is useful for treating a 

CC patient in need of increased levels of STC activity. STC and its 

CC modulators are useful for treating disorders or abnormalities of nervous 

CC system, cerebrovascular diseases, dementia, encephalitis, central nervous 

CC system infections or neoplasms, demyelinating diseases, 



CC encephalomyelitis , spinal cord diseases , mental retardation such as 

CC Down's syndrome, Tay-Sachs disease, neuromuscular diseases such as 

CC muscular dystrophy, myasthenia gravis, deficiencies or disorders of 

CC immune system such as Addison's disease, hemolytic anemia, rheumatoid 

CC arthritis, dermatitis, glomerulonephritis, Goodpasture's Syndrome, 

CC Grave's disease, multiple sclerosis, autoimmune thyroiditis, systemic 

CC lupus erythematosus, insulin dependent diabetes mellitus, allergic 

CC reactions and conditions such as asthma, for treating and/or preventing 

CC organ rejection or graf t-versus-host disease, hyperprolif erative diseases 

CC such as purpura, Gaucher 's disease, cardiovascular disorders such as 

CC arrhythmias, telangiectasia, vasculitis, and for treatment of disease or 

CC disorders with neovascularization. The composition can be used to treat 

CC hemangioma, psoriasis, angiofibroma, atherosclerotic plaques, delayed 

CC wound healing, granulations, Osier-Weber syndrome, solid tumors such as 

CC Kaposi's sarcoma, cancer, AIDS, neurodegenerative disorders such as 

CC Alzheimer's disease, Parkinson's disease, wound healing, and for treating 

CC or detecting infectious agents. The present sequence represents the human 

CC STC polypeptide 

XX 

SQ Sequence 247 AA; 



Query Match 100.0%; Score 1268; DB 4; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRV7VAQNSAEVVRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 



Qy 

Db 

Qy 

Db 

Qy 

Db 



61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 



Qy 

Db 



241 RTSHESA 247 

I II I I II 
241 RTSHESA 247 



RESULT 7 
AAB62690 

ID AAB62690 standard; protein; 247 AA. 
XX 

AC AAB62690; 
XX 

DT 06-AUG-2001 (first entry) 
XX 

DE Lngl08, a diagnostic marker for cancer. 
XX 

KW Lngl08; cancer; diagnostic marker; cytotoxic; immune response; imaging. 



XX 

OS < Homo sapiens. 
XX 

PN WO200132209-A1. 
XX 

PD 10-MAY-2001. 
XX 

PF 03-NOV-2000; 2000WO-US030482 . 
XX 

PR 04-NOV-1999; 99US-0163444P . 
XX 

PA (DIAD-) DIADEXUS INC. 
XX 

PI Recipon H, Macina RA, Chen S, Sun Y; 
XX 

DR WPI; 2001-316386/33. 

DR N-PSDB; AAF83823. 
XX 

PT Novel assay for diagnosing and monitoring cancer, involves determining 

PT levels of Lngl08 in cells, tissues or bodily fluids of the patient, and 

PT comparing with control. 
XX 

PS Disclosure; Page 33-34; 36pp; English. 
XX 

CC The invention relates to diagnosing the presence of cancer or diagnosing 

CC metastases of cancer in a patient that involves determining levels of 

CC Lngl08 in a sample of cells, tissues or bodily fluids in a patient, and 

CC comparing the determined levels with levels of Lngl08 a normal human 

CC control. The method is useful for diagnosing the presence of cancer, 

CC diagnosing metastases of cancer, staging cancer, monitoring cancer, and 

CC monitoring a change in stage of the cancer, in a patient. A therapeutic 

CC agent which is an antibody labeled with paramagnetic ions or a 

CC radioisotope, and conjugated with a cytotoxic agent is useful for imaging 

CC cancer in a patient. A molecule which downregulates the expression or 

CC activity of Lngl08, is useful for treating cancer in a patient. Lngl08 

CC protein is useful for inducing an immune response against a target cell 

CC expressing Lngl08. The present sequence represents the human Lngl08 

CC polypeptide 

XX 

SQ Sequence 247 AA; 

Query Match 100.0%; Score 1268; DB 4; Length 247; 
Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 




Db 



1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 



60 



Qy 



61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 



120 



Db 



120 



QY 



121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 



180 




Db 



121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 



180 



Qy 

Db 



181 
181 



SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 

| | | | | | || | M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I M I I I I I I I i I I I 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 



Qy 241 RTSHESA 247 

I I I I I I I 
Db 241 RTSHESA 247 



RESULT 8 
ABB06259 

ID ABB06259 standard; protein; 247 AA. 
XX 

AC ABB06259; 
XX 

DT 23-MAY-2002 (first entry) 
XX 

DE Human staniocalcin 1 protein. 
XX 

KW Human; staniocalcin 1; osteopathic; osteogenesis failure; osteoporosis; 

KW bone mass reduction; traumatic bone injury; osteomalacia; bone disease; 

KW rheumatic bone disease; cancer associated bone disease; rachitis; 

KW arthritis deformans. 
XX 

OS Homo sapiens. 
XX 

PN WO200204013-A1. 
XX 

PD 17-JAN-2002. 
XX 

PF 10-JUL-2001; 2001WO- JP005962 . 
XX 

PR ll-JUL-2000; 2000JP-00209926. 
XX 

PA (BMLB-) BML INC. 
XX 

PI Yoshiko Y, Koide Y, Igarashi A, Takano S, Maeda N, Aubin JE; 
XX 

DR WPI; 2002-164600/21. 

DR N-PSDB; ABL40225. 
XX 

PT Agent used for treating bone diseases e.g. osteoporosis, traumatic bone 

PT injury, osteomalacia, rheumatic bone diseases, bone diseases associated 

PT with cancer and arthritis deformans containing staniocalcin I. 
XX 

PS Claim 2; Fig 1; 24pp; Japanese. 
XX 

CC The present invention describes an agent containing staniocalcin 1, 

CC particularly of human origin. Staniocalcin 1 has osteopathic activity. 

CC The agent can be used for treating diseases relating to osteogenesis 

CC failure or reduction in bone mass e.g. osteoporosis, traumatic bone 

CC injury, osteomalacia, rheumatic bone diseases, bone diseases associated 

CC with cancer, bone diseases due to phosphorus or calcium metabolic error, 

CC rachitis and arthritis deformans. The agent increases bone mass. The 

CC present sequence represents human staniocalcin 1 
XX 

SQ Sequence 247 AA; 



Query Match 100.0%; Score 1268; DB 5; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 

Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKS RVAAQNSAEVVRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 



Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

M I I I I I I M I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 



Qy 241 RTSHESA 247 

I I II II I 

Db 241 RTSHESA 247 



RESULT 9 






ABP97748 






ID 


ABP97748 standard; protein; 247 AA. 






XX 








AC 


ABP97748; 






XX 








DT 


28-MAY-2003 (first entry) 






XX 








DE 


Amino acid sequence of human STC1 polypeptide. 






XX 








KW 


Human; angiogenesis ; wound healing; retinopathy; 


ischemia; 


inflammation; 


KW 


microvasculopathy; bone healing; skin inflammation; HOG3; 


HOG8; HOG18; 


KW 


follicular development; CA9; HXB; IGFBP5; HFARP; 


STC1; mig 


-6; SSR4; 


KW 


cancer . 






XX 








OS 


Homo sapiens. 






XX 








PN 


WO2003010205-A1. 






XX 








PD 


06-FEB-2003. 






XX 








PF 


26-JUL-2002; 2002WO-US023786 . 






XX 








PR 


26-JUL-2001; 2001US-0307600P . 






PR 


24-JUL-2002; 2002US-00201642 . 






XX 








PA 


(UYDU-) UNIV DUKE MEDICAL CENT. 






XX 








PI 


Riggins GJ, Lai A; 






XX 








DR 


WPI; 2003-239423/23. 







DR N-PSDB; ABZ77288. 
XX 

PT Inhibiting angiogenesis for treating wound healing, retinopathy, 

PT ischemia, inflammation, microvasculopathy, bone healing, skin 

PT inflammation or follicular development by providing to a subject an 

PT antisense polynucleotide, 

XX 

PS Claim 4; Page 61; 66pp; English. 
XX 

CC The present sequence is a human STCl polypeptide. It is used in the 

CC method of the invention. The specification describes a method modulating 

CC angiogenesis associated with wound healing, retinopathy, ischemia, 

CC inflammation, microvasculopathy, bone healing, skin inflammation or 

CC follicular development. The method comprises providing to a subject H0G3, 

CC H0G8, H0G18, CA9, HXB, IGFBP5, HFARP, STCl, mig-6 or SSR4. The methods, 

CC antisense polynucleotides, polypeptides and antibodies are useful for 

CC treating wound healing, retinopathy, ischemia, inflammation, 

CC microvasculopathy, bone healing, skin inflammation or follicular 

CC development, or cancer such as breast, colon or lung cancer, or 

CC glioblastoma 

XX 

SQ Sequence 247 AA; 



Query Match 100.0%; Score 1268; DB 6; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYS7VAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

I I I I I I I I! I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 

Db 241 RTSHESA 247 



RESULT 10 
ABR47600 

ID ABR47600 standard; protein; 247 AA. 
XX 

AC ABR47600; 
XX 

DT 12-JUN-2003 (first entry) 
XX 

DE Breast cancer associated protein sequence SEQ ID NO: 441. 



XX 

KW Human; breast cancer; cytostatic; gene therapy. 
XX 

OS Homo sapiens. 
XX 

PN WO2003004989-A2. 
XX 

PD 16-JAN-2003. 
XX 

PF 21-JUN-2002; 2002WO-US019669 . 
XX 

PR 21-JUN-2001; 2001US-0299887P . 

PR 27-JUN-2001; 2001US-0301572P . 

PR 18-JUL-2001; 2001US-0306501P . 

PR 25-SEP-2001; 2001US-0325002P . 

PR 05-MAR-2002; 2002US-0362585P . 

PR 14-MAY-2002; 2002US-0380391P . 
XX 

PA (MILL- ) MILLENIUM PHARM INC. 
XX 

PI Lillie J, Gannavarapu M, Glatt K, Hoersh S, Kamatkar S; 

PI Mertens M, Monahan JE, Myer V, Wang Y, Xu Y, Zhao X, Meyers RE; 

PI Bast RC, Hortobagyi GN, Pusztai L, Meric F, Sahin A, Mills GB; 

XX 

DR WPI; 2003-210381/20. 

DR N-PSDB; ACC50301. 
XX 

PT Breast cancer diagnosis or treatment by comparing the level of expression 

PT of a marker in a patient sample with that in the control non-breast 

PT cancer sample. 
XX 

PS Claim 1; SEQ ID NO 441; 128pp; English. 
XX 

CC The present invention describes a method for assessing whether a patient 

CC is afflicted with breast cancer . The method comprises comparing the level 

CC of expression of a marker (gene/polypeptide see ACC50076 to ACC50334 and 

CC ABR47386 to ABR47632) in a patient sample and the normal level of 

CC expression of the marker in a control non-breast cancer sample, where a 

CC significant increase in the level of expression of the marker in the 

CC patient sample and the normal level is an indication that the patient is 

CC afflicted with breast cancer. The breast cancer associated sequences from 

CC the present invention have cytostatic activities and can be used in gene 

CC therapy. The method is useful for diagnosing and treating breast cancer. 

CC N.B. The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at f tp . wipo . int/pub/published_pct_sequences 

XX 

SQ Sequence 247 AA; 

Query Match 100.0%; Score 1268; DB 6; Length 247; 
Best Local^ Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 

Db 



1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEVVRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 



Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 

Db 241 RTSHESA 247 ' 



RESULT 11 


ABG72290 


ID 


Aob /zzyu stanaara, protein, / 


XX 




AC 




w 
AA 




r\<~n 
ui 


± 4 i v i/\k z u u 3 entry; 


VV 
AA 




uiL, 


numan Lorpuscies 01 otdnniub pj_0Ltrj.11. 


YV 
AA 




_\W 


n.U.iIlclIl / A/UipiibLlco O J_ O LdlllllUo piU UclU / t—cl _L L-_L Uill U^LaJVC XlJil±U± LUL / 


J\VV 


J_cllci_L cALIcLlUIl pilL»op__ci L.C , ci.cLLi.Uly L-tr UJ.OUIUC1 / -L cliai uiocaoc, 


I\VV 


Katid co3 cp • }n 3 r" "t~ rli cpa cp • ^r"f~^r"t^l Vi wp» p v--! - c^-n c: i pir-i « hnnp >~ pi c* P) y*r*i H on ' 
JJOI1 ti UlocdoC ^ lltr a. J_ L- uiocaod, a. J_ L. t: J_ x d _L iiypci lciid±uii/ uuiic icjui^liuh/ 


rvvv 


nc+"onrir\rnc; i c • not" ' q /-^nc;f=iac:c>» Vi t ry~\ p rpa 1 c* ^ P>m "I ^ * Vl \7T^i p rsl" n\7rm fj 1 <s rri * 
Ob LcUpuiUoib , JrciytiU o uibcaoc, iiypcL ua± LacilU. d , iiypcJ.yaLa uny LU-LUXoiit^ 


KW 


Vi \/-n^ rvi "h^mirmcti T) • t~i imnii t • spniin calcium 1 pvp 1 r Srirroi dosis ; 


KW 


hyperthyroidism; adrenal insufficiency; serum albumin; plasma protein; 


KW 


GI calcium absorption; stanniocalcin; osteopathic; cytostatic; 


KW 


hypotensive . 


XX 




OS 


Homo sapiens . 


XX 




PN 


US2002146791-A1. 


XX 




PD 


10-OCT-2002 . 


XX 




PF 


05-APR-2002; 2002US-00116051 . 


XX 




PR 


08-MAR-1994; 94US-00208 005 . 


PR 


28-APR-1995; 95US-00431117 . 


PR 


17-MAY-1999; 99US-00312610 . 


XX 




PA 


(HUMA-) HUMAN GENOME SCI INC. 


XX 




PI 


01 sen HS, Adams MD; 


XX 




DR 


WPI; 2003-147579/14. 


DR 


N-PSDB; ABX15152. 


XX 




PT 


Isolated polynucleotide for encoding polypeptides used to diagnose and 


PT 


treat electrolyte disorders leading to renal disease, e.g. Paget ' s 



disease, hypercalcemia and sarcoidosis. 
Claim 13; Fig 1; 2 6pp; English. 

The present invention relates to the isolation of human Corpuscles of 
Stannius protein, and the polynucleotide sequence encoding it. Human 
Corpuscles of Stannius protein inhibits calcium uptake and reduces renal 
excretion of phosphate. The polynucleotide sequence is useful for 
encoding Corpuscles of Stannius protein for therapeutic purposes, e.g. in 
the treatment of electrolyte disorders that lead to renal, bone or heart 
disease, such as arterial hypertension, disorders due to elevated bone 
resorption (e.g. osteoporosis and Paget ! s disease), and hypercalcaemia 
(e.g. hyperparathyroidism, hypervitaminosis D) , tumours that raise serum 
calcium levels by destroying bone, sarcoidosis, hyperthyroidism, adrenal 
insufficiency, loss of serum albumin secondary to renal diseases, or 
excessive GI calcium absorption and elevated concentration of plasma 
proteins. The encoded polypeptide exhibits a high degree of homology to 
Stanniocalcin from Anguilla australis and from Oncorhynchus kisutch. The 
present sequence represents human corpuscles of stannius protein 

Sequence 247 AA; 

Query Match 100.0%; Score 1268; DB 6; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKS RVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKS RVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYS7UU<FDTQGKABVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 
Db 241 RTSHESA 247 



PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



RESULT 12 
ADD48082 

ID ADD48082 standard; protein; 247 AA. 
XX 

AC ADD48082; 
XX 

DT 29-JAN-2004 (first entry) 
XX 

DE Human Protein P52823, SEQ ID NO 13779. 
XX 



KW Human; pain; neuronal tissue; gene therapy; 

KW spinal segmental nerve injury; chronic constriction injury; CCI; 

KW spared nerve injury; SNI; Chung. 

XX 

OS Homo sapiens. 
XX 

PN WO2003016475-A2. 
XX 

PD 27-FEB-2003. 
XX 

PF 14-AUG-2002; 2002WO-US025765 . 
XX 

PR 14-AUG-2001; 2001US-0312147P . 

PR 01-NOV-2001; 2001US-0346382P. 

PR 26-NOV-2001; 2001US-0333347P . 
XX 

PA (GEHO ) GEN HOSPITAL CORP. 

PA (FARB ) BAYER AG. 

XX 

PI Woolf C, D'urso D, Befort K, Costigan M; 
XX 

DR WPI; 2003-268312/26. 

DR GEN BANK; P52823. 
XX 

PT New composition comprising two or more isolated polypeptides, useful for 

PT preparing a medicament for treating pain in an animal. 

XX 

PS Claim 1; Page; 1017pp; English. 
XX 

CC The invention discloses a composition comprising two or more isolated rat 

CC or human polynucleotides or a polynucleotide which represents a fragment, 

CC derivative or allelic variation of the nucleic acid sequence. Also 

CC claimed are a vector comprising the novel polynucleotide, a host cell 

CC comprising the vector, a method for identifying a nucleotide sequence 

CC which is differentially regulated in an animal subjected to pain and a 

CC kit to perform the method, an array, a method for identifying an agent 

CC that increases or decreases the expression of the polynucleotide sequence 

CC that is differentially expressed in neuronal tissue of a first animal 

CC subjected to pain, a method for identifying a compound which regulates 

CC the expression of a polynucleotide sequence which is differentially 

CC expressed in an animal subjected to pain, a method for identifying a 

CC compound that regulates the activity of one or more of the 

CC polynucleotides, a method for producing a pharmaceutical composition, a 

CC method for identifying a compound or small molecule that regulates the 

CC activity in an animal of one or more of the polypeptides given in the 

CC specification, a method for identifying a compound useful in treating 

CC pain and a pharmaceutical composition comprising the one or more 

CC polypeptides or their antibodies. The polynucleotide or the compound that 

CC modulates its activity is useful for preparing a medicament for treating 

CC pain (e.g. spinal segmental nerve injury (Chung), chronic constriction 

CC injury (CCI) and spared nerve injury (SNI)) in an animal (e.g. gene 

CC therapy) . The sequence presented is a human protein (shown in Table 2 of 

CC the specification) which is differentially expressed during pain. Note: 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic form directly from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences . 



SQ Sequence 247 AA; 



Query Match 100.0%; Score 1268; DB 7; Length 247; 

Best Local Similarity 100.0%; Pred. No. 1.2e-123; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLQN S AVL L VL VI S AS AT H EAEQN D S VS P RK S RVAAQN S AE WRC LN S ALQ VG C GAFAC L 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I 
Db 61 ENSTCDTDGMYDICKSFLYST^FDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I i I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 I AEVQEECYS KLNVCS IAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTI RD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 
Db 241 RTSHESA 247 



RESULT 13 
ADD48080 

ID ADD48080 standard; protein; 247 AA. 

XX 

AC ADD48080; 
XX 

DT 29-JAN-2004 (first entry) 
XX 

DE Rat Protein P97574, SEQ ID NO 13777. 
XX 

KW Rat; pain; neuronal tissue; gene therapy; spinal segmental nerve injury; 

KW chronic constriction injury; CCI ; spared nerve injury; SNI; Chung. 

XX 

OS Rattus norvegicus. 
XX 

PN WO2003016475-A2. 
XX 

PD 27-FEB-2003. 
XX 

PF 14-AUG-2002; 2002WO-US025765 . 
XX . 

PR 14-AUG-2001; 2001US-0312147P . 

PR 01-NOV-2001; 2001US-0346382P . 

PR 26-NOV-2001; 2001US-0333347P . 
XX 

PA (GEHO ) GEN HOSPITAL CORP. 

PA (FARB ) BAYER AG. ■ 

XX 

PI Woolf C, D'urso D, Befort K, Costigan M; 
XX 



DR WPI; 2003-268312/26. 

DR GEN BANK; P97574. 
XX 

PT New composition comprising two or more isolated polypeptides, useful for 

PT preparing a medicament for treating pain in an animal. 

XX 

PS Claim 1; Page; 1017pp; English. 
XX 

CC The invention discloses a composition comprising two or more isolated rat 

CC or human polynucleotides or a polynucleotide which represents a fragment, 

CC derivative or allelic variation of the nucleic acid sequence. Also 

CC claimed are a vector comprising the novel polynucleotide, a host cell 

CC comprising the vector, a method for identifying a nucleotide sequence 

CC which is differentially regulated in an animal subjected to pain and a 

CC kit to perform the method, an array, a method for identifying an agent 

CC that increases or decreases the expression of the polynucleotide sequence 

CC that is differentially expressed in neuronal tissue of a first animal 

CC subjected to pain, a method for identifying a compound which regulates 

CC the expression of a polynucleotide sequence which is differentially 

CC expressed in an animal subjected to pain, a method for identifying a 

CC compound that regulates the activity of one or more of the 

CC polynucleotides, a method for producing a pharmaceutical composition, a 

CC method for identifying a compound or small molecule that regulates the 

CC activity in an animal of one or more of the polypeptides given in the 

CC specification, a method for identifying a compound useful in treating 

CC pain and a pharmaceutical composition comprising the one or more 

CC polypeptides or their antibodies. The polynucleotide or the compound that 

CC modulates its activity is useful for preparing a medicament for treating 

CC pain (e.g. spinal segmental nerve injury (Chung), chronic constriction 

CC injury (CCI) and spared nerve injury (SNI) ) in an animal (e.g. gene 

CC therapy) . The sequence presented is a rat protein (shown in Table 2 of 

CC the specification) which is differentially expressed during pain. Note: 

CC The sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic form directly from WIPO at 

CC ftp.wipo.int/pub/published_pct__sequences. 
XX 

SQ Sequence 247 AA; 

Query Match 97.1%; Score 1231; DB 7; Length 247; 

Best Local Similarity 96.0%; Pred. No. 8.6e-120; 

Matches 237; Conservative 6; Mismatches 4; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I : I I I I I I : I I I I I I I I I i I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLQN S AVI LAL VI S AAAAH EAEQN D S VS P RK S RVAAQN S AE WRC LN S ALQ VGC GAFAC L 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I II I I I I I I I I I I 
Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGITSKVFLAI RRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I : I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 121 IAEVQEDCYSKLNVCSIAKRNPEAITEVIQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEGDSPSHIK 240 



Qy 241 RTSHESA 247 

III I : I 
Db 241 RTSQENA 247 



RESULT 14 
AAB56848 

ID AAB56848 standard; protein; 276 AA. 
XX 

AC AAB56848; 
XX 

DT 13-MAR-2001 (first entry) 
XX 

DE Human prostate cancer antigen protein sequence SEQ ID NO: 1426. 
XX 

KW Human; prostate cancer; prostate cancer antigen; detection; diagnosis; 

KW neuroprotective; cytostatic; cardioactive; immunomodulatory; muscular; 

KW vulnerary; gastrointestinal; nephrotropic; antiinf ective; gynaecological; 

KW antibacterial; gene therapy; neural; immune; reproductive; renal; 

KW gastrointestinal; pulmonary; cardiovascular; proliferative disorder; 

KW wound; infectious disease. 

XX 

OS Homo sapiens. 
XX 

PN WO200055174-A1. 
XX 

PD 21-SEP-2000. 
XX 

PF 08-MAR-2000; 2000WO-US005988 . 
XX 

PR 12-MAR-1999; 99US-0124270P . 
XX 

PA (HUMA-) HUMAN GENOME SCI INC. 

PA (ROSE/) ROSEN C A. 

XX 

PI Rosen CA, Ruben SM; 
XX 

DR WPI; 2000-587513/55. 

DR N-PSDB; AAF16051. 
XX 

PT Prostate cancer associated gene sequences, referred to as prostate cancer 

PT antigens, useful for treatment, prevention, and diagnosis of disorders 

PT such as prostate cancer. 
XX 

PS Claim 11; Page 1858-1859; 2338pp; English. 
XX 

CC AAF15566 to AAF16505 encode the human prostate cancer associated 

CC proteins, called prostate cancer antigens, given in AAB56363 to AAB57302. 

CC The prostate cancer antigens can have neuroprotective, cytostatic, 

CC cardioactive, immunomodulatory, muscular, vulnerary, gastrointestinal, 

CC nephrotropic, antiinf ective, gynaecological and antibacterial activities, 

CC and can be used in gene therapy. The prostate cancer antigen 

CC polynucleotides may be used for detection of prostate cancer, chromosome 

CC identification, as chromosome markers, and for numerous other diagnostic 

CC or research purposes. The prostate cancer antigens may be used to treat 

CC disorders such as neural, immune, muscular, reproductive, 



CC gastrointestinal, pulmonary, cardiovascular, renal, and proliferative 

CC disorders, wounds, and infectious diseases. AAF16506 to AAF16514 to 

CC AAB57303 represent sequences used in the exemplification of the present 

CC invention 

XX 

SQ Sequence 276 AA; 

Query Match 73.9%; Score 937; DB 3; Length 276; 

Best Local Similarity 99.5%; Pred. No. 5e-89; 
. Matches 185; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MLQN S AVLLVLVI SAS AT H EAEQND S VS P RKS RVAAQN SAEWRCLNS ALQVGCGAFACL 60 

I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 89 MLQN S AVLLVLVI SAS AT H EAEQNDS VS P RKS RVAAQN SAEWRC LNS ALQVGCGAFACL 148 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN I I I I I I I 
Db 149 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 208 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 209 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 268 

Qy 181 SLMEKI 186 

I I I I I 

Db 269 SLMEXI 274 



RESULT 15 




AAP82968 




ID 


AAP82968 standard; protein; 261 AA. 


XX 






AC 


AAP82968; 




XX 






DT 


25-MAR-2003 


( revised) 


DT 


30-NOV-1990 


(first entry). 


XX 






DE 


Corpuscles of 


Stannius CS protein precursor. 


XX 






KW 


Corpuscles of 


Stannius; CS protein; cardiovascula. 


KW 


heart failure; high blood pressure. 


XX 






OS 


Anguilla australis. 


XX 






FH 


Key 


Location/Qualifiers 


FT 


Peptide 


1. .17 


FT 




/labels signal peptide/prohormone 


FT 


Peptide 


18. .288 


FT 




/label= pro-CS 


XX 






PN 


WO8803949-A. 




XX 






PD 


02-JUN-1988. 




XX 






PF 


19-NOV-1987; 


87WO-AU000389. 


XX 






PR 


21-NOV-1986; 


86AU-00009107. 



PR 31-DEC-1986; 87AU-00082797 . 

PR 22-MAY-1987; 87AU-0000208 6 . 
XX 

PA (FLOR-) FLOREY INST EXPER. 

PA (BUTK/) BUTKUS A. 

XX 

PI Butkus A, Coghlan JP, Roche PJ; 
XX 

DR WPI; 1988-161619/23. 

DR N-PSDB; AAN80655. 
XX 

PT CS protein of corpuscles of stannius - used for treating cardiovascular 

PT disease, renal disease and electrolyte disorders. 

XX 

PS Disclosure; Page ?; 38pp; English. 
XX 

CC This prepro-CS protein sequence has its signal or pro-hormone fragment 

CC (amino acids -17 to 1) cleaved to yield the pro-CS form which can be 

CC processed to yield the mature form. Recombinant DNA methods are utilised 

CC in the prodn. of the CS protein. It is useful as e.g. a therapeutic agent 

CC for the treatment of cardiovascular disease and oedema. See also 

CC AAN80654. (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 261 AA; 

Query Match 53.7%; Score 680.5; DB 1; Length 261; 

Best Local Similarity 61.4%; Pred. No. 2.8e-62; 

Matches 124; Conservative 41; Mismatches 36; Indels 1; Gaps 1; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I : I : : I I I : : I : I : : : : : I I I : I : I : : : I I I I I I I I I I I I I I I I 
Db 1 MLRMSGLILTLVL-VTAAYEQDESEPLSPRTARFSASSPSDVARCLNGALQVGCSAFACL 59 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

: I I I I : I I I I : : I I : I.I I : I I I I I I I I I I I I I I I I I I I I I : i I I I I I I I I I I : I I : I 
Db 60 DNSTCNTDGMHEICRSFLHGAAKFDTQGKTFVKESLKCIANGITSKVFLTIRRCSSFQKM 119 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I : I I I I I I I I I I : : I I : I : MM: II I : I : I Mil: I : : I I I I I I I I I : I 
Db 120 ISEVQEECYSKLDLCSVAQSNPEAMGEVAQVPSQFPNRYYSTLLQSLLTCDEDTVEQVRA 179 

Qy 181 SLMEKI GPNMAS LFHI LQTDHC 2 02 

I : : : I I I I : I I I I 
Db 180 GLVSRLEPEMGVLFQLLQTKAC 201 



Search completed: July 19, 2004, 15:32:16 
Job time : 57 sees 



GenCore version 5.1,6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: . July 19, 2004, 15:31:13 ; Search time 19 Seconds 

(without alignments) 
671.138 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-614-990-2 
1268 

1 MLQNSAVLLVLVI S AS AT HE . 



. NLRGEEDSPSHIKRTSHESA 247 



Scoring table: 



BLOSUM62 
Gapop 10.0 



Gapext 0 . 5 



Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1 : /cgn2_6/ptodata/2 /iaa/5A_COMB . pep : * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3: /cgn2_6/ptodata/2/iaa/6A__COMB.pep: * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptqdata/2/iaa/PCTUS_COMB.pep:* 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 








Description 


1 


. 1268 


100. 0 


247 


2 


us- 


08- 


208- 


005C-2 


Sequence 


2, Appli 


2 


1268 


100.0 


247 


2 


US- 


09- 


038- 


597A-2 


Sequence 


2, Appli 


3 


1268 


100.0 


247 


2 


us- 


08- 


460- 


529B-10 


Sequence 


10, Appl 


4 


1268 


100.0 


247 


2 


us- 


08- 


431- 


117A-2 


Sequence 


2, Appli 


5 


1268 


100. 0 


247 


4 


us- 


09- 


361- 


736B-10 


Sequence 


10, Appl 


6 


660.5 


52.1 


204 


2 


us- 


08- 


208- 


005C-5 


Sequence 


5, Appli 


7 


660.5 


52.1 


204 


2 


us- 


09- 


038- 


597A-5 


Sequence 


5, Appli 


8 


628 


49.5 


170 


2 


us- 


08- 


460- 


529B-9 


Sequence 


9, Appli 


9 


628 


49.5 


170 


4 


us- 


09- 


361- 


736B-9 


Sequence 


9, Appli 


10 


354.5 


28.0 


296 


3 


us- 


08- 


831- 


132-14 


Sequence 


14, Appl 


11 


354.5 


28.0 


296 


3 


us- 


09- 


416- 


150-14 


Sequence 


14, Appl 



12 


354 


27 . 


9 


302 


3 


us- 


08- 


831- 


132-2 


Sequence 


2, Appli 


13 


354 


27 . 


9 


302 


3 


us- 


09- 


416- 


150-2 


Sequence 


2, Appli 


14 


354 


27. 


9 


302 


4 


us- 


09- 


193- 


881-23 


Sequence 


23, Appl 


15 


354 


27 . 


9 


302 


4 


US- 


09- 


361- 


736B-12 


Sequence 


12, Appl 


16 


351.5 


27 . 


7 


251 


4 


us- 


09- 


361- 


736B-2 


Sequence 


2, Appli 


17 


333 . 5 


26. 


3 


251 


2 


us- 


08- 


460- 


529B-2 


Sequence 


2, Appli 


18 


89.5 


7 . 


1 


10182 


4 


us- 


09- 


134- 


001C-3159 


Sequence 


3159 f Ap 


19 


86 


6. 


8 


500 


4 


us- 


09- 


198- 


452A-299 


Sequence 


2 99, App 


20 


85.5 


6. 


7 


656 


4 


us- 


09- 


134- 


001C-4322 


Sequence 


4322, Ap 


21 


83 . 5 


6. 


6 


290 


4 


us- 


09- 


655- 


908-6 


Sequence 


6, Appli 


22 


83.5 


6. 


6 


290 


4 


us- 


09- 


655- 


908-8 


Sequence 


8, Appli 


23 


83 . 5 


6. 


6 


1027 


4 


us- 


09- 


762- 


724-8 


Sequence 


8, Appli 


24 


83 . 5 


6. 


6 


1029 


4 


us- 


09- 


762- 


724-6 


Sequence 


6, Appli 


25 


83 


6. 


5 


417 


4 


us- 


09- 


134- 


001C-3810 


Sequence 


3810, Ap 


26 


82 


6 . 


5 


362 


4 


us- 


09- 


134- 


001C-4670 


Sequence 


4 670, Ap 


27 


81 


6. 


4 


680 


4 


us- 


09- 


298- 


924-4 


Sequence 


4, Appli 


28 


81 


6. 


4 


720 


2 


us- 


08- 


840- 


236-1 


Sequence 


1, Appli 


29 


81 


6 . 


4 


720 


2 


us- 


08- 


505- 


44 8A-1 


Sequence 


1, Appli 


30 


79.5 


6 . 


3 


708 


1 


us- 


08- 


145- 


681-4 


Sequence 


4, Appli 


31 


79.5 


6 . 


3 


708 


1 


us- 


08- 


453- 


703-4 


Sequence 


4, Appli 


32 


79 . 5 


6 . 


3 


708 


2 


us- 


08- 


456- 


106-4 


Sequence 


4, Appli 


33 


79 . 5 


6. 


3 


708 


3 


us- 


08- 


456- 


108-4 


Sequence 


4, Appli 


34 


79 . 5 


6 . 


3 


708 


3 


us- 


09- 


265- 


577-4 


Sequence 


4, Appli 


35 


79.5 


6 . 


3 


708 


4 


us- 


09- 


633- 


739-4 


Sequence 


4, Appli 


36 


79 


6 . 


2 


264 


4 


us- 


09- 


134- 


000C-5831 


Sequence 


5831, Ap 


37 


78.5 


6 . 


2 


1288 


4 


us- 


09- 


546- 


934-4 


Sequence 


4, Appli 


38 


78 


6 . 


2 


315 


4 


us- 


09- 


•184- 


-964-4 


Sequence 


4, Appli 


3Q 


78 


6 . 


2 


338 


4 


us- 


09- 


-328- 


•352-4723 


Sequence 


4723, Ap 


40 


78 


6 . 


2 


383 


4 


us- 


■09- 


-328- 


•352-5969 


Sequence 


5969, Ap 


41 


78 


6. 


2 


781 


4 


us- 


•09- 


•738- 


•946-8 


Sequence 


8, Appli 


42 


77.5 


6. 


1 


467 


2 


us- 


-08^ 


•686- 


-599A-17 


Sequence 


17, Appl 


43 


76.5 


6. 


0 


315 


4 


us- 


■09- 


-615- 


-192A-378 


Sequence 


378, App 


44 


76 


6. 


0 


376 


4 


us- 


-09- 


-489- 


-039A-10020 


Sequence 


10020, A 


45 


76 


6. 


0 


2802 


3 


ys- 


-09- 


-542- 


-331-1 


Sequence 


1, Appli 



ALIGNMENTS 



RESULT 1 

US-08-208-005C-2 

; Sequence 2, Application US/08208005C 

; Patent No. 5837498 

; GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL . 

TITLE OF INVENTION: Corpuscles of Stannius Protein, Stanniocalcin 
NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
; ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 
; STATE: NEW JERSEY 

COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 



COMPUTER: IBM PS/2 
; OPERATING SYSTEM: MS-DOS 

; SOFTWARE : WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/208 , 005C 
FILING DATE: 8 MARCH 1994 
CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
; NAME: FERRARO, GREGORY D. 

; REGISTRATION NUMBER: 36,134 

REFERENCE/ DOCKET NUMBER: 325800-78 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
; TELEFAX : 201-994-1744 

INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 24 7 AMINO ACIDS 
TYPE: AMINO ACID 
; STRANDEDNESS: 

TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-08-208-005C-2 

Query Match 100.0%; Score 1268; DB 2; Length 247; 

Best Local Similarity 100.0%; Pred. No. 2.3e-136; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I I I 1 1 1 1 1 I I I 1 1 I I I I II I I ! I I 1 1 1 1 I I I I I 1 1 1 1 1 I I 1 1 

Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I 
Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 
Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 

Db 241 RTSHESA 247 



RESULT 2 

US-09-038-597A-2 

; Sequence 2, Application US/09038597A 
; Patent No. 5877290 
; GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL. 



; TITLE OF INVENTION: Corpuscles of Stannius Protein, 
TITLE OF INVENTION: Stanniocalcin 
NUMBER OF SEQUENCES : 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARE L LA, BYRNE, BAIN, GILFILLAN, 
ADDRESSEE: CECCHI, STEWART & OLSTEIN 
; STREET: 6 BECKER FARM ROAD 

; CITY: ROSELAND 

; STATE: NEW JERSEY 

; COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/038 , 597A 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/208,005 
FILING DATE: 8-MARCH-1994 
ATTORNEY/AGENT INFORMATION: 
NAME: FERRARO, GREGORY D. 
REGISTRATION NUMBER: 36,134 
; REFERENCE/DOCKET NUMBER: 325800-78 

; TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 247 AMINO ACIDS 
TYPE: AMINO ACID 
; STRANDEDNESS: 

TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-09-038-597A-2 

Query Match 100.0%; Score 1268; DB 2; Length 247; 

Best Local Similarity 100.0%; Pred. No. 2.3e-136; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I 
Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 



Qy 241 RTSHESA 247 

I 1 I I I I I 

Db 241 RTSHESA 2 47 



RESULT 3 

US-08-460-529B-10 

; Sequence 10, Application US/08460529B 
; Patent No. 5994103 
; GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL. 

TITLE OF INVENTION: Human ' Stanniocalcin-alpha 
NUMBER OF SEQUENCES : 10 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
; ADDRESSEE: CECCHI , STEWART & OLSTEIN 

; STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 

COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
; COMPUTER: IBM PS/2 

; OPERATING SYSTEM: MS-DOS 

SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/460, 529B 

FILING DATE: June 2, 1995 

CLASSIFICATION: 435 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US94 / 132 06 
; FILING DATE: 10 NOV 1994 

; ATTORNEY/AGENT INFORMATION: 

NAME: MULLINS, J.G. 

REGISTRATION NUMBER: 33,073 

REFERENCE/ DOCKET NUMBER: 325800-334 (PF143) 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 201-994-1700 

; TELEFAX: 201-994-1744 

; INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 247 AMINO ACIDS 
TYPE: AMINO ACID 
; STRANDEDNESS: 

TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-08-460-529B-10 

Query Match 100.0%; Score 1268; DB 2; Length 247; 

Best Local Similarity 100.0%; Pred. No. 2.3e-136; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I j 



Db 



1 



MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 



Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

RTSHESA 247 

I I I I I I I 

RTSHESA 247 



RESULT 4 

US-08-431-117A-2 

; Sequence 2, Application US/08431117A 
; Patent No. 5994301 
; GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL. 
; TITLE OF INVENTION: Corpuscles of Stannius Protein, Stanniocalcin 
; NUMBER OF SEQUENCES: 8 

; CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 
; CITY: ROSELAND 

STATE: NEW JERSEY 

COUNTRY: USA 
; ZIP: 07068 

COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 
; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/431 , 117A 
; FILING DATE: 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/208,005 
; FILING DATE: 8 MARCH 1994 

; ATTORNEY/ AGENT INFORMATION: 

NAME: FERRARO, GREGORY D. 

REGISTRATION NUMBER: 36,134 
; REFERENCE/ DOCKET NUMBER: 3258 00-296 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 

TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 2: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 247 AMINO ACIDS 



TYPE: AMINO ACID 
STRANDEDNESS: 
TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-08-431-117A-2 



Query Match 100.0%; Score 1268; DB 2; Length 247; 

Best Local Similarity 100.0%; Pred. No. 2.3e-136; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy - 


181 


Db 


181 


Qy 


241 


Db 


241 



MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

| | | | | | | I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 



60 



ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I | | | | | | M I I I I I I I I I I I I I I I I t I I I I I ! I I M I I I I M I I I I I I I I I I I I I II I I I 

ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 
| | | | | | | I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I II I I I I 



| | | | | | | | | | | | | | | | | I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 



MINI 



RESULT 5 

US-09-361-736B-10 

Sequence 10, Application US/09361736B 
Patent No. 6613877 
GENERAL INFORMATION: 
APPLICANT: Human Genome Sciences, Inc. 
TITLE OF INVENTION: Human Stanniocalcin-Alpha 
FILE REFERENCE: PF143P1D1 

CURRENT APPLICATION NUMBER: US/09/361, 736B 
CURRENT FILING DATE: 1999-07-28 
PRIOR APPLICATION NUMBER: 08/460,529 
PRIOR FILING DATE: 1995-06-02 . 
PRIOR APPLICATION NUMBER: PCT/ US94/13206 
PRIOR FILING DATE: 1994-11-30 
NUMBER OF SEQ ID NOS : 12 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 10 
LENGTH: 247 
TYPE: PRT 
■ ORGAN I SM : human 
US-09-361-736B-10 

Query Match 100.0%; Score 1268; DB 4; Length 247; 

Best Local Similarity 100.0%; Pred. No. 2.3e-136; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 



1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSA£WRCLNSALQVGCGAFACL 60 

| | | | | | | I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I 

1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFIiAIRRCSTFQRM 120 

121 I7VEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

241 RTSHESA 247 

I I II I I I 
241 RTSHESA 247 



RESULT 6 

US-08-208-005C-5 

; Sequence 5, Application US/08208005C 
; Patent No. 5837498 

GENERAL INFORMATION: 
; APPLICANT: OLSEN, ET AL . 

TITLE OF INVENTION: Corpuscles of Stannius Protein, Stanniocalcin 
NUMBER OF SEQUENCES: 5 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
ADDRESSEE: CECCHI, STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
CITY: ROSELAND 
; STATE: NEW JERSEY 

; COUNTRY: USA 

ZIP: 07068 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 
; OPERATING SYSTEM: MS-DOS 

, SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/208 , 005C 

FILING DATE: 8 MARCH 1994 
CLASSIFICATION: .435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 
; FILING DATE: 

ATTORNEY/AGENT INFORMATION: 
; NAME: FERRARO, GREGORY D-. 

; REGISTRATION NUMBER: 36,134 

REFERENCE/ DOCKET NUMBER: 325800-78 
; TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX: 201-994-1744 
; INFORMATION FOR SEQ ID NO: 5: 



QY 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



SEQUENCE CHARACTERISTICS: 

LENGTH: 204 AMINO ACIDS 

TYPE: AMINO ACID 

STRANDEDNESS: 

TOPOLOGY: LINEAR 
; MOLECULE TYPE: PROTEIN 
US-08-208-005C-5 

Query Match 52.1%; Score 660.5; DB 2; Length 204; 

Best Local Similarity 60.8%; Pred. No. 3.7e-67; 

Matches 118; Conservative 38; Mismatches 37; Indels 1; Gaps 1; 
Qy 11 LVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGM 70 



Db 12 LVLGTAATFDTDPEEA-SPRRARFSSNSPSDVARCLNGALAVGCGTFACLENSTCDTDGM 7 0 

Qy 71 YDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRMIAEVQEECYS 130 

: I I I : I : : I I I : I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I 
Db 71 HDICQLFFHTAATFNTQGKT FVKESLRCIANGVTSKVFQTIRRCGVFQRMISEVQEECYS 130 

Qy 131 KLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNM 190 

: I : : I : i : I I I I I I I I I : I II MM: I : : I I I I I I : I I : : I I : ~ : I I : I 
Db 131 RL D I C GVARS N P EAI GE WQVPAH FPNRY Y S T L LQ S L LAC DE ET VAWRAGL VARL G P DM 190 

Qy 191 ASLFHILQTDHCAQ 204 

: I I : I I I I I 
Db 191 ETLFQLLQNKHCPQ 2 04 



RESULT 7 

US-09-038-597A-5 

; Sequence 5, Application US/09038597A 
; Patent No. 5877290 
; GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL . 
; TITLE OF INVENTION: Corpuscles of Stannius Protein, 
; TITLE OF INVENTION: Stanniocalcin 
; NUMBER OF SEQUENCES: 5 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 

ADDRESSEE: CECCHI , STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 

CITY: ROSELAND 

STATE: NEW JERSEY 
; COUNTRY: USA 

ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 

COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 
; SOFTWARE: WORD PERFECT 5.1 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/038, 597A 

FILING DATE: 

CLASSIFICATION: 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/208,005 



FILING DATE: 8-MARCH-l 994 
ATTORNEY/ AGENT INFORMATION: 
NAME: FERRARO, GREGORY D. 
REGISTRATION NUMBER: 36,134 
REFERENCE/ DOCKET NUMBER: 325800-78 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 2 01-994-1700 
TELEFAX : 201-994-1744 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 204 AMINO ACIDS 
TYPE: AMINO ACID 
STRANDEDNESS: 
TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-09-038-597A-5 

Query Match 52.1%; Score 660.5; DB 2; Length 204; 

Best Local Similarity 60.8%; Pred. No. 3.7e-67; 

Matches 118; Conservative 38; Mismatches 37; Indels 1; Gaps 1; 

Qy 11 LVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGM 70 

M: : I I : : : : II I : : I : : : : : I I I I I II I I I I I I I I I I I I I I I I I I 
Db 12 LVLGTAATFDTDPEEA-SPRRARFSSNSPSDVARCLNGALAVGCGTFACLENSTCDTDGM 70 

Qy 71 YDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAEVQEECYS 130 

: I I I : I :: I I Mill I I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I 
Db 71 HDICQLFFHTAATFNTQGKTFVKESLRCIANGVTSKVFQTIRRCGVFQRMISEVQEECYS 130 

Qy 131 KLNVC S I AKRNPEAI T EWQLPNHFSNRYYNRLVRS LLECDEDTVS T I RDS LMEKI GPNM 190 

: I : : I : I : I I I I I I I I I : I II MM: I : : I I I I I I : I I : : I I : : : I I : I 
Db 131 RLDICGVARSNPEAIGEWQVPAHFPNRYYSTLLQSLLACDEETVAWRAGLVARLGPDM 190 

Qy 191 ASLFHILQTDHCAQ 204 

: I I : I I III 
Db 191 ETLFQLLQNKHCPQ 2 04 



RESULT 8 

US-08-460-529B-9 

Sequence 9, Application US/08460529B 
Patent No. 5994103 
GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL. 

TITLE OF INVENTION: Human Stanniocalcin-alpha 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
ADDRESSEE: CECCHI, STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
CITY: ROSELAND 
STATE: NEW JERSEY 
COUNTRY: USA 
ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 



OPERATING SYSTEM: MS-DOS 
SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/460, 529B 
FILING DATE: June 2, 1995 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/US94/13206 
FILING DATE: 10 NOV 1994 
ATTORNEY/AGENT INFORMATION: 
NAME: MULLINS, J.G. 
REGISTRATION NUMBER: 33,073 

REFERENCE/ DOCKET NUMBER: 325800-334' (PF143) 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX: 201-994-1744 
INFORMATION FOR SEQ ID NO: 9: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 170 AMINO ACIDS 
TYPE: AMINO ACID 
STRANDEDNESS: 
TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-08-460-529B-9 

Query Match 4 9.5%; Score 628; DB 2; Length 170; 

Best Local Similarity 66.5%; Pred. No. 1.4e-63; 

Matches 113; Conservative 30; Mismatches 27; Indels 0; Gaps 0; 

Qy 28 SPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQ 87 

III : I : I : : : I I I I I I I I I I I I I I I I : I I I I : I I I I : : I I : I I I : I I I I I I I 

Db 1 SPRTARFSASSPSDVARCLNGALQVGCSAFACLDNSTCNTDGMHEICRSFLHGAAKFDTQ 60 

Qy 88 GKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE 147 

II I I I I I I I I I I I I : I I I I II I I I I I : I I : I I : I I I I I I I I I I :: I I : I : MM: I 

Db 61 GKTFVKESLKCIANGITSKVFLTIRRCSSFQKMISEVQEECYSKLDLCSVAQSNPEAMGE 12 0 

Qy 148 WQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHIL 197 

I I : I : I I I II : I :: I I I I I I I I I : I I : : : I I I I : I 
Db 121 VAQVPSQFPNRYYSTLLQSLLTCDEDTVEQVRAGLVSRLEPEMGVLFQLL 17 0 



RESULT 9 

US-09-361-736B-9 

Sequence 9, Application US/09361736B 
Patent No. 6613877 
GENERAL INFORMATION: 
APPLICANT: Human Genome Sciences, Inc. 
TITLE OF INVENTION: Human Stanniocalcin-Alpha 
FILE REFERENCE: PF143P1D1 

CURRENT APPLICATION NUMBER: US/ 09/361 , 736B 
CURRENT FILING DATE: 1999-07-28 
PRIOR APPLICATION NUMBER: 08/460,529 
PRIOR FILING DATE: 1995-06-02 
PRIOR APPLICATION NUMBER: PCT/ US94/13206 
PRIOR FILING DATE: 1994-11-30 
NUMBER OF SEQ ID NOS : 12 



; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 9 

LENGTH: 170 
; TYPE: PRT 

; ORGANISM: Anguilla australis 
US-09-361-736B-9 



Query Match 49.5%; Score 628; DB 4; Length 170; 

Best Local Similarity 66.5%; Pred. No. 1.4e-63; 

Matches 113; Conservative 30; Mismatches 27; Indels 0; Gaps 0; 

Qy 28 SPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQ 87 

III : I : I : : : I I I I I I I I I I I I I I I I : I I I I : I I I I : : I I : I I I : I I II II I 
Db 1 SPRTARFSASSPSDVARCLNGALQVGCSAFACLDNSTCNTDGMHEICRSFLHGAAKFDTQ 60 

Qy 88 GI^FVKESLKCIANGVTSKVFLAIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE 147 

II I I I I I I I I I I I I : I II II I I I I I I : I I : I I : I I I I II I I I I :: I I : I : MM: I 
Db 61 GKTFVKESLKCIANGITSKVFLTIRRCSSFQKMISEVQEECYSKLDLCSVAQSNPEAMGE 120 

Qy 148 WQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHIL 197 

I I : I : I I I I I : I :: I I I I I I I I I : I I : : : I I I I : I 
Db 121 VAQVPSQFPNRYYSTLLQSLLTCDEDTVEQVRAGLVSRLEPEMGVLFQLL 170 



RESULT 10 
US-08-831-132-14 

Sequence 14, Application US/08831132 
Patent No. 6008322 
GENERAL INFORMATION: 

APPLICANT: Kuestner, Rolf E. 
APPLICANT: Conklin, Darrell C. 
APPLICANT: Lok, Si 
APPLICANT: Buddie, Michele 
APPLICANT: Downey, William 
TITLE OF INVENTION: STANNIOCALCIN-2 
NUMBER OF SEQUENCES: 22 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: ZymoGenetics , Inc. 
STREET: 1201 Eastlake Avenue East 
CITY: Seattle 
STATE: WA 
COUNTRY: USA 
ZIP : 98102 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/831,132 
FILING DATE: 
CLASSIFICATION: 424 
ATTORNEY/AGENT INFORMATION: 
NAME: Sawislak, Deborah A. 
REGISTRATION NUMBER: 37,438 
REFERENCE/ DOCKET NUMBER: 96-01 
TELECOMMUNICATION INFORMATION: 



TELEPHONE: 206-442-6672 
TELEFAX: 206-442-6678 
INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 296 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-831-132-14 

Query Match 28.0%; Score 354.5; DB 3; Length 296; 

Best Local Similarity 31.7%; Pred. No. 5.4e-32; 

Matches 85; Conservative 45; Mismatches 109; Indels 29; Gaps 5; 

Qy 7 VLLVLVISASATHEAEQNDSVSP RKS RVAAQNSAEWRCLNS ALQVGCGA 56 

| I I I : | : II : I : I I : : I I : I I : II : I I I I I 

Db 10 VTLALVF— ATLDPAQGTDSTNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGV 67 

Qy 57 FACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCST 116 

I I II:: |: I : : I I : I h :.l I I.I I I I : h h : h I I : : ' I hi 
Db 68 FECFENNSCEIQGLHGICMTFLHNAGKFDAQGKSFIKDALRCKAHALRHKFGCISRKCPA 127 

Qy - H7 FQRMIAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVS 176 

: | : : : I I I I I : : I I I : I I h : : : I II I I I I I 

Db 128 IREMVFQLQRECYLKHDLCSAAQENVGVIVEMIHFKDLLLHEPYVDLVNLLLTCGEDVKE 187 

Qy 177 TIRDSLMEKIGPNMASLFHIL — QTDHCAQTH PRADFNRRRTNEPQKLKV 224 

: I: : : I II : II I I" : I = 

Db 188 AVTRSVQAQCEQSWGGLCSILSFCTSNIQRPPTAAPEHQPLADRAQLSRPHHRDTDHHLT 247 

Qy 225 LLRNLRGEEDSPSHIK RTSHESA 247 

I : I I I I I I I : I I 

Db 24 8 ANRGAKGERGSKSHPNAHARGRTGGQSA 275 



RESULT 11 
US-09-416-150-14 

Sequence 14, Application US/09416150 
Patent No. 6171822 

GENERAL INFORMATION: 

APPLICANT: Kuestner, Rolf E. 

Conklin, Darrell C. 
Lok, Si 

Buddie, Michele 
Downey, William 
TITLE OF INVENTION: STANNIOCALCIN-2 
NUMBER OF SEQUENCES: 22 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: ZymoGenetics , Inc. 
STREET: 1201 Eastlake Avenue East 
CITY: Seattle 
STATE: WA 
COUNTRY: USA 
ZIP: 98102 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 



; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/416, 150 
FILING DATE: ll-Oct-1999 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/831,132 
FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 

NAME: Sawislak, Deborah A. 
REGISTRATION NUMBER: 37,438 
; REFERENCE/ DOCKET NUMBER: 96-01 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 206-442-6672 

TELEFAX: 206-442-6678 
; INFORMATION FOR SEQ ID NO: 14: 
SEQUENCE CHARACTERISTICS: 
; ' LENGTH: 296 amino acids 

; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
US-09-416-150-14 

Query Match 28.0%; Score 354.5; DB 3; Length 296; 

Best Local Similarity 31.7%; Pred. No. 5.4e-32; 

Matches 85; Conservative 45; Mismatches 109; Indels 29; Gaps 5, 

7 VLLVLVI SASATHEAEQNDSVSP RKSRVAAQNSAEWRCLNSALQVGCGA 56 

| | || : |: || :| : I I : : I I : II : I I : I I I I I 

10 VTL7^VF--ATLDPAQGTDSTNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGV 67 

57 FACLENSTCDTDGMYDICKSFLYSA7VKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCST 116 

| i | | | : : | : I : : I I : II : : I I I I I I I : I : I : : I : I I : : I hi 
68 FECFENNSCEIQGLHGICMTFLHNAGKFDAQGKSFIKDALRCKARALRHKFGCISRKCPA 127 

117 FQRMIAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVS 17 6 

: |: ::| III I ::|| h I I h: : . : I I I Mill 
128 IREMVFQLQRECYLKHDLCSAAQENVGVIVEMIHFKDLLLHEPYVDLVNLLLTCGEDVKE 187 

177 TIRDSLMEKIGPNMASLFHIL — QTDHCAQTH PRADFNRRRTNEPQKLKV 224 

: |: : : I II : II MM : 

188 AVTRSVQAQCEQSWGGLCSILSFCTSNIQRPPTAAPEHQPIADRAQLSRPHHRDTDHHLT 247 

225 LLRNLRGEEDSPSHIK RTSHESA 247 

I : I I I I I I I M I 

248 ANRGAKGERGS KSHPNAHARGRTGGQSA 275 



RESULT 12 
US-08-831-132-2 

; Sequence 2, Application US/08831132 

; Patent No. 6008322 

; GENERAL INFORMATION: 

; APPLICANT: Kuestner, Rolf E. 

APPLICANT: Conklin, Darrell C. 



QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



APPLICANT: Lok, Si 
APPLICANT: Buddie, Michele 
APPLICANT: Downey, William 
TITLE OF INVENTION: STANNIOCALCIN-2 
NUMBER OF SEQUENCES: 22 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: ZymoGenetics , Inc. 
STREET: 1201 Eastlake Avenue East 
CITY: Seattle 
STATE: WA 
COUNTRY: USA 
ZIP: 98102 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE : Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/831, 132 
FILING DATE: 
CLASSIFICATION: 424 
ATTORNEY/AGENT INFORMATION : 
NAME: Sawislak, Deborah A. 
REGISTRATION NUMBER: 37,438 
REFERENCE/ DOCKET NUMBER: 96-01 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 206-442-6672 
TELEFAX: 206-442-6678 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 302 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-831-132-2 

Query Match 27.9%; Score 354; DB 3; Length 302; 

Best Local Similarity 32.4%; Pred. No. 6.4e-32; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4; 

LVLVIS AS AT H EAE - QN D S VS P RK S RVAAQN S AEWRCLN S ALQVG C GAFAC L 60 

|||:: II: I i I :| I:: I I : I I : II : I MM I I 



I I : : I : I : : I I : II : : I M I M h I : I : : M I I : : : hi : I 

ENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGCISRKCPAIREM 131 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

:: : : I I M I : : h h I I h : : : I I I M I h I 

VSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 

SLMEKIGPNMASLFHILQ-TDHCAQTHPRADFNRRRTNEPQKL — 222 

I : : I II II III h : M 



Qy 


9 


Db 


12 


Qy 


61 


Db 


72 


Qy 


121 


Db 


132 


Qy 


181 


Db 


192 



Qy 



223 KVLLRNLRGEEDSPSH 238 



Db 



: I : I I III 

252 RETGRGAKGERGSKSH 267 



RESULT 13 
US-09-416-150-2 

; Sequence 2, Application US/09416150 
; Patent No. 6171822 

GENERAL INFORMATION: 

APPLICANT: Kuestner, Rolf E. 
; Conklin, Darrell C. 

; Lok, Si 

; Buddie , Michele 

; Downey, William 

TITLE OF INVENTION: STANNIOCALCIN-2 
NUMBER OF SEQUENCES: 22 
; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: Z ymoGenetics , Inc. 

; STREET: 1201 Eastlake Avenue East 

CITY: Seattle 
STATE: WA 
COUNTRY: USA 
; ZIP: 98102 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/09/416,150 

FILING DATE: ll-Oct-1999 
; CLASSIFICATION: <Unknown> 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/831,132 
FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
; NAME : Sawislak, Deborah A. 

; REGISTRATION NUMBER: 37,438 

; REFERENCE/ DOCKET NUMBER: 96-01 

TELECOMMUNICATION INFORMATION: 
TELEPHONE: 206-442-6672 
TELEFAX: 206-442-6678 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 302 amino acids 
; TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
; SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

US-09-416-150-2 

Query Match 27.9%; Score 354; DB 3; Length 302; 

Best Local Similarity 32.4%; Pred. No. 6.4e-32; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4; 

Qy 9 LVLVIS AS AT H EAE - QN D S VS P RK S RVAAQN S AE WRCLN S ALQVG C GAFAC L ■ 60 

Ml:: ||: I I I : I I : : I I : I I : I I : I I I I I I I 



Db 



12 LALVLATFDPARGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 



Qy 



Db 



61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGWSKVFLAI RRCSTFQRM 120 

I I : : I : I : : I I : I I : : I I I I I I I : I : I : : I I I I : : : I : I : I 

72 ENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGCISRKCPAI REM 131 



Qy 



121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 



Db 



132 VSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 



Qy 




Db 



192 SVQVQCEQNWGSLCSILSFCTSAIQKPPTAPPERQPQVDRTKLSRAHHGEAGHHLPEPSS 251 



Qy 



223 KVLLRNLRGEEDSPSH 238 



Db 



252 RETGRGAKGERGSKSH 267 



RESULT 14 
US-09-193-881-23 

; Sequence 23, Application US/09193881A 

; Patent No. 6538119 

; GENERAL INFORMATION: 

; APPLICANT: Patricia Billing-Medel 

; APPLICANT: Maurice Cohen 

; APPLICANT: Tracey L. Colpitts 

APPLICANT: Paula N. Friedman 
; APPLICANT: Edward N. Granados 
; APPLICANT: Michael R. Klass 
; APPLICANT: John C. Russell 

APPLICANT: Stephen D. Stroupe 
; TITLE OF INVENTION: Reagents and Methods Useful for Detecting Diseases of the 
; TITLE OF INVENTION: Breast 

FILE REFERENCE: 6248. US. Pi 
; CURRENT APPLICATION NUMBER: US/09/193, 881A 
; CURRENT FILING DATE: 1998-11-18 
; NUMBER OF SEQ ID NOS: 29 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 23 

LENGTH: 302 

TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-193-881-23 

Query Match 27.9%; Score 354; DB 4; Length 302; 

Best Local Similarity 32.4%; Pred. No. 6.4e-32; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4; 

Qy 9 LVLVIS AS AT HEAE - QN D S VS P RK S RVAAQN S AE WRC LN S ALQVGC GAFAC L 60 

III:: II: I I I : I I : : I I : I I : II : I I I I I I I 

Db 12 LALVLATFDPARGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I : : I : I : : I I : I I : : I I I I I i I : I : I : : I I I I : : : hi : I 

Db 72 ENNSCEI RGLHGI CMT FLHNAGKFDAQGKS FI KDALKCKAHALRHRFGCI SRKCPAI REM 131 



Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

: : : : I I I I I : : I : I : I I I : : : : I I I I I I I : I 
Db 132 VSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 

Qy 181 SLMEKIGPNMASLFHILQ-TDHCAQTHPRADFNRRRTNEPQKL 222 

I :: I II I I I I I I :: I I 

Db 192 SVQVQCEQNWGSLCSILSFCTSAIQKPPTAPPERQPQVDRTKLSRAHHGEAGHHLPEPSS 251 

Qy 223 KVLLRNLRGEEDSPSH 238 

: I : I I I I I 
Db 252 RETGRGAKGERGSKSH 267 



RESULT 15 
US-09-361-736B-12 

; Sequence 12, Application US/09361736B 
; Patent No. 6613877 
; GENERAL INFORMATION: 

; APPLICANT: Human Genome Sciences , Inc. 

; TITLE OF INVENTION: Human Stanniocalcin-Alpha 

; FILE REFERENCE: PF143P1D1 

; CURRENT APPLICATION NUMBER: US/09/ 3 61 , 736B 

CURRENT FILING DATE: 1999-07-28 
; PRIOR APPLICATION NUMBER: 08/460,529 
; PRIOR FILING DATE: 1995-06-02 
; PRIOR APPLICATION NUMBER: PCT/ US94/13206 

PRIOR FILING DATE: 1994-11-30 
; NUMBER OF SEQ ID NOS : 12 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 12 
; LENGTH: 302 
TYPE: PRT 
ORGANISM: human 
US-09-361-736B-12 

Query Match 27.9%; Score 354; DB 4; Length 302; 

Best Local Similarity 32.4%; Pred. No. 6.4e-32; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4; 

Qy 9 LVLVIS AS ATH EAE- QN D S VS P RKS RVAAQN S AE WRCLN S ALQVGC GAFACL 60 

III:: II: I I I : I I : : I I : I I : I I : I I I I I I I 

Db 12 LALVLATFDPARGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 

I I : : I : I : : I I :.| I : : I I I I I I I : I : I : : I I I I : : : hi : I 

Db 72 ENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGCISRKCPAI REM 131 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

: : : : I I I I I : : I : I : I I I : ■: : : I II II I h I 
Db 132 VSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 

Qy 181 SLMEKIGPNMASLFHILQ-TDHCAQTHPRADFNRRRTNEPQKL 222 

I :: I I I I I I I I I :: I I 

Db 192 SVQVQCEQNWGSLCSILSFCTSAIQKPPTAPPERQPQVDRTKLSRAHHGEAGHHLPEPSS 251 



QY 



223 KVLLRNLRGEEDSPSH 238 
: I : I I III 



Db 



252 RETGRGAKGERGSKSH 2 67 



Search completed: July 19, 2004, 15:34:25 
Job time : 2 0 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



July 19, 2004, 15:30:13 ; Search time 16 Seconds 

(without alignments) 
1484.957 Million cell updates/sec 



Title: US-10-614-990-2 

Perfect score: 1268 
Sequence : 

Scoring table: 



1 MLQN S AVLLVLVI S AS ATHE NLRGEEDSPSHIKRTSHESA 247 

BLOSUM62 
Gapop 10.0 , Gapext 0.5 



Searched: 



283366 seqs, 96191526 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database : PIR_78:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted -by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


680.5 


53. 


7 


263 


2 


A54648 


stanniocalcin prec 


2 


662.5 


52. 


2 


256 


2 


151197 


stanniocalcin - co 


3 


354 


27. 


9 


302 


2 


JE0357 


stanniocalcin homo 


4 


150 


11. 


8 


40 


2 


B60841 


teleocalcin - coho 


5 


149 


11. 


8 


40 


2 


A60841 


teleocalcin - sock 


6 


123 


9. 


7 


33 


2 


S06337 


teleocalcin - rain 


7 


102 


. 8. 


0 


473 


2 


T04799 


hypothetical prote 


8 


94 


7. 


4 


1105 


2 


T18295 


Ap-3 adaptor compl 


9 


90.5 


7. 


1 


289 


2 


T50776 


hypothetical prote 


10 


90.5 


7. 


1 


783 


2 


T38690 


probable regulator 


11 


89.5 


7. 


1 


1126 


2 


T01491 


ubiqui tin-protein 


12 


89 


7. 


0 


305 


2 


T08121 


peroxidase (EC 1.1 


13 


88 


6. 


9 


473 


2 


T45954 


hypothetical prote 



14 


87 . 5 


6.9 


1465 


2 


S45628 


DNA-directed DNA p 


15 


86 


6.8 


470 


2 


H72097 


mg++ transporter ( 


16 


86 


6.8 


470 


2 


F86526 


Mg++ transporter [ 


17 


86 


6.8 


510 


2 


T45952 


hypothetical prote 


18 


86 


6.8 


556 


2 


S51858 


probable membrane 


19 


86 


6.8 


581 


2 


S58201 


probable membrane 


20 


83 


6.5 


250 


2 


S30584 


hypothetical prote 


21 


83 


6.5 


250 


2 


D69215 


conserved hypothet 


22 


83 


6.5 


365 


2 


T06693 


hypothetical prote 


23 


83 


6.5 


1093 


2 


T50652 


AP-3 complex beta3 


24 


83 


6.5 


1094 


2 


T50651 


AP3-complex beta-3 


25 


82 . 5 


6.5 


869 


2 


A88710 


protein C43G2.2 [i 


26 


82 . 5 


6.5 


1483 


2 


S30015 


hypothetical prote 


27 


82 . 5 


6.5 


1757 


2 


T05204 


hypothetical prote 


28 


82 . 5 


6.5 


2335 


2 


T40186 


probable phosphati 


29 


82 . 5 


6. 5 


2535 


2 


T04824 


hypothetical prote 


30 


82 


6. 5 


815 


2 


G72209 


conserved hypothet 


31 


82 


6 . 5 


925 


2 


T01384 


hypothetical prote 


32 


81.5 


6.4 


463 


2 


T15416 


hypothetical prote 


33 


81 . 5 


6. 4 


549 


2 


F64640 


conserved hypothet 


34 


81.5 


6. 4 


708 


1 


TFBOL 


lactotransf errin p 


35 


81 


6. 4 


358 


2 


S76692 


hypothetical prote 


36 


81 


6.4 


720 


2 


JC5131 


glycosyltransf eras 


37 


81 


6.4 


1375 


2 


T18961 


FAB1 protein homol 


38 


81 


6 . 4 


4377 


2 


A55575 


ankyrin 3, long sp 


39 


80 


6. 3 


281 


2 


H84720 


probable endonucle 


40 


80 


6. 3 


509 


1 


VGNVPC 


major envelope gly 


41 


80 


6.3 


509 


2 


T10395 


protein gp64 - Org 


42 


80 


6.3 


985 


2 


T00633 


Ca2+- transporting 


43 


80 


6.3 


998 


2 


T52581 


Ca2+- transporting 


44 


80 


6.3 


1178 


2 


S54073 


probable membrane 


45 


80 


6.3 


1188 


2 


T46608 


zinc finger protei 



ALIGNMENTS 



RESULT 1 
A54648 

stanniocalcin precursor - Australian eel 

N;Alternate names: Stannius corpuscle secretory protein 
C;Species: Anguilla australis (Australian eel) 

C;Date: 19-Mar-1997 #sequence__revision 19-Mar-1997 #text_change 07-May-1999 
C; Accession: A5464 8 

R;Butkus, A.; Roche, P.J.; Fernley, R.T.; Haralambidis , J.; Penschow, J.D.; 
Ryan, G.B.; Trahair, J.F.; Tregear, G.W. ; Coghlan, J. P. 
Mol. Cell. Endocrinol. 54, 123-133, 1987 

A;Title: Purification and cloning of a corpuscles of Stannius protein from 
Anguilla australis. 

A/Reference number: A54648; MUID : 88083961 ; PMID:3319739 
A; Accession: A54648 
A; Status: preliminary 
A;Molecule type: mRNA 
A; Residues: 1-263 <BUT> 

F; 1-17 /Domain: signal sequence #status predicted <SIG> 
F;18-263/Product : stanniocalcin #status predicted <MAT> 



Query Match 53.7%; Score 680.5; DB 2; 

Best Local Similarity 61.4%; Pred. No. 2.3e-51; 
Matches 124; Conservative 41; Mismatches 36; 



Length 263; 



Indels 1; Gaps 



1; 



Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKS RVAAQNSAEWRCLNSALQVGCGAFACL 60 

II: I : : I II: : I : I : : : : : I I I : I : I : : : I MM II I I II Mill 
Db 1 MLRMSGLILTLVL-WAAYEQDESEPLSPRTARFSASSPSDVARCLNGALQVGCSAFACL 59 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 12 0 

: II I I : I I I I: : II : II I : II II II II I II II I I II I I I I : I I II I I I I I I I : I I : I 

Db 60 DNSTCNTDGMHEICRSFLHGAAKFDTQGKTFVKESLKCIANGITSKVFLTIRRCSSFQKM 119 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I : I II I II I M I : : I I : I : I I M : II MM I I I I M M ■ : I I I II I I II : I 

Db 120 ISEVQEECYSKLDLCSVAQSNPEAMGEVAQVPSQFPNRYYSTLLQSLLTCDEDTVEQAmA 179 

Qy 181 S LMEKI GPNMASL FHI LQT DHC 202 

M : : I I M : M I I 
Db 180 GLVSRLEPEMGVLFQLLQTKAC 201 



RESULT 2 
151197 

stanniocalcin - coho salmon 

C; Species: Oncorhynchus kisutch (coho salmon) 

C;Date: 13-Sep-1996 #sequence_revision 13-Sep-1996 #text_change 21-Jul-2000 
C;Accession: 151197 

R;Wagner, G.F.; Dimattia, G.E.; Davie, J.R.; Copp, D.H.; Friesen, H.G. 
Mol. Cell. Endocrinol. 90, 7-15, 1992 

A; Title: Molecular cloning and cDNA sequence analysis of coho salmon 
stanniocalcin. 

A; Reference number: '151197; MUID : 93246046; PMID: 1363790 
A;Accession: 151197 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-256 <WAG> 

A; Cross-references: GB:S59519; NID:g299926; PIDN:AAB26419 . 1; PID:g299927 

Query Match 52.2%; Score 662.5; DB 2; Length 256; 

Best Local Similarity 53.4%; Pred. No. 8.1e-50; 

Matches 125; Conservative 46; Mismatches 54; Indels 9; Gaps 4; 

Qy 11 LVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGM 7 0 

IM : I I I I I :: I ::::: I I I I I I I I I I I I I I I I I I I I I I I I I. 

Db 12 LVLGTAATFDTDPEEA-SPRRARFSSNSPSDVARCLNGALAVGCGTFACLENSTCDTDGM 70 

Qy 71 YDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAEVQEECYS 130 

: I I M I :: I I M I I II I II II M II I II I I I I II II I I I II II : I II I I II I 

Db 71 HDICQLFFHTAATFNTQGKTFVKESLRCIANGVTSKVFQTIRRCGVFQRMISEVQEECYS 130 

Qy 131 KLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNM 190 

: I: : I : I : Mill II I I : I II M I M I : : I I I II M I M : I M : : I M I 

Db 131 RL D I C GVARS N P EAI G EVVQVP AH FPNRYYSTLLQS LLAC D E ET VAWRAGL VARLGP DM 190 

Qy 191 ASLFHILQTDHCAQTHPRADFN RRRTNEPQKLKVLLRNLRGEEDSPSHI 239 

: I I : I I III : : I I I : : : M : MM 

Db 191 ETLFQLLQNKHCPQGSNQGPNSAPAGWRWPMGSPPSFKI-QPSMRGRD — PTHL 241 



RESULT 3 
JE0357 

stanniocalcin homolog - human 

N;Alternate names: STC2 

C; Species: Homo sapiens (man) 

C;Date: 05-Feb-1999 #sequence_revision 05-Feb-1999 #text_change 21-Jul-2000 
C;Accession: JE0357 

R;Ishibashi, K. ; Miyamoto, K. ; Taketani, Y. ; Morita, K . ; Takeda, E. ; Sasaki, S.; 
Imai, M. 

Biochem. Biophys . Res. Commun. 250, 252-258, 1998 

A; Title: Molecular cloning of a second human stanniocalcin homologue (STC2) . 
A;Reference number: JE0357; MUID : 9844 0784 ; PMID:9753616 
A;Accession: JE0357 
A; Molecule type: mRNA 
A; Residues: 1-302 <ISH> 

A; Cross-references : DDBJ: AB012664 ; NID : g3702223 ; PIDN : BAA33489 . 1 ; PID:g3702224 
C; Comment: This protein suppressed expression of renal sodium/phosphate 
cotransporter . 
C; Genetics : 

A;Introns: 51/1 98/3 169/2 

Query Match 27.9%; Score 354; DB 2; Length 302; 

Best Local Similarity 32.4%; Pred. No. 4.6e-23; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4; 

Qy 9 LVLVIS AS AT HEAE- QN D S VS P RKS RVAAQN S AE WRCLN S ALQVGC GAFACL 60 

III:: II: I I I : I I : : I I : I I : I I : I I I I I I I 

Db 12 IALVLATFDPARGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I :: I : I : : I I : I I : : I I I I I I I : I : I : : I I I I : : : hi : I 

Db 72 ENNSCEI RGLHGI CMT FLHNAGKFDAQGKS FI KDALKCKAHALRHRFGCI SRKCPAI REM 131 

Qy 121 IAEVQEECYSKLNVCSI7VKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

: : : : I I I I I : : I : I : I I I : : : : I II I I I I : I 
Db 132 VSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 

Qy 181 SLMEKIGPNMASLFHILQ-TDHCAQTHPRADFNRRRTNEPQKL 222 

I :: -I I I I I III I : : I I 

Db 192 SVQVQCEQNWGSLCSILSFCTSAIQKPPTAPPERQPQVDRTKLSRAHHGEAGHHLPEPSS 251 

Qy 223 KVLLRNLRGEEDSPSH 238 

: I : I I I I I 
Db 252 RETGRGAKGERGSKSH 267 



RESULT 4 
B60841 

teleocalcin - coho salmon (fragment) 

C; Species: Oncorhynchus kisutch (coho salmon) 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 30-Sep-1993 
C;Accession: B60841 

R;Wagner, G.F.; Fenwick, J.C.; Park, CM.; Milliken, C. ; Copp, D.H.; Friesen, 
H.G. 

Gen. Comp. Endocrinol. 72, 237-246, 1988 



A; Title: Comparative biochemistry and physiology of teleocalcin from sockeye and 
coho salmon. 

A; Reference number: A60841; MUID: 89065334 ; PMID: 3197944 
A; Accession: B608 41 
A;Molecule type: protein 
A; Residues: 1-40 <WAG> 

C;Comment: This glycoprotein hormone from the corpuscles of Stannius regulates 
calcuim uptake through the gills. 

C; Keywords: disulfide bond; glycoprotein; hormone 

Query Match 11.8%; Score 150; DB 2; Length 40; 

Best Local Similarity 66.7%; Pred. No. 1.9e-06; 

Matches 2 6; Conservative 6; Mismatches 7; Indels 0; Gaps 0; 

Qy 35 AAQN S AE WRC LN S AL Q VG C G AFAC LENSTCDTD GM YD I 73 

::::: I I I I I I I I I I I I I I I I I 1 I I I I I I : I I 

Db 2 SSNSP5DVARCLNGALAVGCGTFACLEXSTCDTDGMHDI 40 



RESULT 5 
A60841 

teleocalcin - sockeye salmon (fragment) 

C; Species: Oncorhynchus nerka (sockeye salmon) 

C;Date: 30-Sep-1993 #sequence_revision 30-Sep-1993 #text_change 30-Sep-1993 
C;Accession: A60841 

R;Wagner, G.F.; Fenwick, J.C.; Park, CM. ; Milliken, C; Copp, D.H.; Friesen, 
H.G. 

Gen. Comp. Endocrinol. 72, 237-246, 1988 

A; Title: Comparative biochemistry and physiology of teleocalcin from sockeye and 
coho salmon. 

A;Reference number: A60841; MUID : 89065334 ; PMID: 3197944 
A;Accession: A60841 
A;Molecule type: protein 
A; Residues: 1-4 0 <WAG> 

C; Comment: This glycoprotein hormone from the corpuscles of Stannius regulates 
calcuim uptake through the gills. 

C; Keywords: disulfide bond; glycoprotein; hormone 

Query Match 11.8%; Score 149; DB 2; Length 40; 

Best Local Similarity 76.5%; Pred. No. 2.3e-06; 

Matches 26; Conservative 3; Mismatches 5; Indels 0; Gaps 0; 

Qy 4 0 AEWRCLNSALQVGCGAFACLENSTCDTDGMYDI 7 3 

: : I I I I I I I I I I I I I I I I I I.I I I I I I : I I 

Db 7 SDVARCLNGALDVGCGTFACLEXSTCDTDGMHDI 40 



RESULT 6 
S06337 

teleocalcin - rainbow trout (fragment) 
N; Alternate names: hypocalcin 

C; Species: Oncorhynchus mykiss (rainbow trout) 

C;Date: 29-Jan-1993 #sequence_revision 29-Jan-1993 #text_change 18-Jun-1993 
C;Accession: S06337 

R;Lafeber, F. P . J. G. ; Hanssen, R.G.J.M.; Choy, Y.M.; Flik, G. ; Herrmann-Erlee, 
M.P.M.; Pang, P.K.T.; Wendelaar Bong'a, S.E. 
Gen. Comp. Endocrinol. 69, 19-30, 1988 



A;Title: Identification of hypocalcin ( teleocalcin) isolated from trout stannius 
corpuscles. 

A; Reference number: S06337; MUID: 88196801 ; PMID: 3360288 

A/Accession: S0.6337 

A;Molecule type: protein 

A; Residues: 1-33 <LAF> 

A; Note: 7-Glu was also found 

C;Keywords: dimer; glycoprotein; hormone 

F;29/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 9.7%; Score 123; DB 2; Length 33; 

Best Local Similarity 65.6%; Pred. No. 0.00032; 

Matches 21; Conservative 5; Mismatches 6; Indels 0; Gaps 0; 

35 AAQN S AE WRC LN S ALQ VG C GAFAC L EN S T C D 66 
: : : : : I I I I I I I I I I I I I I I I I I I I I 

2 S S N S P S D VARC LN GALAVGC GT FAC L EN S T C D 33 



Qy 

Db 



RESULT 7 
T04799 

hypothetical protein F10M23.100 - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 23-Apr-1999 #sequence_revision 23-Apr-1999 #text_change ll-Jun-1999 
C;Accession: T04799 

R; Be van, M. ; Lecharny, A.; Chefdor, F. ; Krivitzky, M. ; Kreis, M. ; Hoheisel, J. ; 
Mewes, H.W.; Mayer, K.F.X.; Schueller, C. 

submitted to the Protein Sequence Database, February 1999 

A; Reference number: Z15385 

A;Accession: T04799 

A;Molecule type: DNA 

A; Residues: 1-473 <BEV> 

A;Cross-references: EMBL : AL035440 

A; Experimental source: cultivar Columbia; BAC clone F10M23 
C; Genetics : 
A;Map position: 4 

A;Introns: 24/3; 92/3; 184/3; 256/3; 308/3; 412/3 
A; Note: F10M23.100 

Query Match 8.0%; Score 102; DB 2; Length 473; 

Best Local Similarity 21.0%; Pred. No. 0.49; 

Matches 50; Conservative 43; Mismatches 67; Indels 78; Gaps 11; 

S VS P RK S RVAAQN S AE WRC LN S ALQ VG C GAFAC L EN S T C D T DGMYDICKSF-LYS 80 

: I : I : | | : : | | : : : | : I : I I I I I I I I : 

NVYKKKVELAAKSRAELLQTLSD ATVELSNLTTALGEKSYIDICDSMSLFP 100 

AAKFDTQGKAFVKESLKCIA NG — 102 

11:11111 I I 



'LAI RRCST FQRMI AEVQEEC YS KL NVCSIAKRN-PEAITEV — 148 

I : :: I I I I : I : I : I : I :: | : : : : : I I I 



Qy 


26 


Db 


50 


Qy 


81 


Db 


101 


Qy 


103 


Db 


159 


Qy 


149 



-VQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHILQT 199 
II : II I I : : : I I I : : : : : I : : : : I 



Db 219 SLDEANGVQTKS-ISNETLARLAKTVLTLKEDKMQRLKK--LQELATQLTDLWNLMDT 273 



RESULT 8 
T18295 

Ap-3 adaptor complex beta3A chain - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 15-Oct-1999 #sequence_revision 15~Oct-1999 #text_change 15-Oct-1999 
C;Accession: T18295 

R;Feng, L.; Seymour, A.B.; Jiang, S.Y.; To, A.; Peden, A. A. ; Novak, E.K.; Zhen, 
L.; Rusiniak, M.E.; Eicher, E.M. ; Robinson, M.S.; Gorin, M.B.; Swank, R.T. 
Hum. Mol. Genet. 8, 323-330, 1999 

A;Title: The beta3A subunit gene (Ap3bl) of the AP-3 adaptor complex is altered 
in the mouse hypopigmentation mutant pearl, a model for Hermansky-Pudlak 
syndrome and night blindness. 

A; Reference number: Z18864; MUID : 99135912 ; PMID: 9931340 
A;Accession: T18295 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1105 <FEN> 

A; Cross-references: EMBL : AF103809; NID : g3885987 ; PID : g3885988 ; PIDN : AAC78338 . 1 

A; Experimental source: strain C3H/HeJ 

C; Genetics : 

A; Gene: Ap3bl 

A; Map position: 13 

Query Match 7.4%; Score 94; DB 2; Length 1105; 

Best Local Similarity 21.2%; Pred. No. 6.5; 

Matches 55; Conservative 40; Mismatches 104; Indels 60; Gaps 11; 

1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 
: I I : : | : : : \ : I I : I I : I I I I ■ : I : 

308 LLQSRNAAWMAVAQLYWH ISPKSE — AGVISKSLVRLLRSNREVQYIVLQNI 358 

61 ENSTCDTDGMYD-ICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFL 109 

: : I I :: MM: II I I : I I I 

359 ATMSIERKGMFEPYLKSF-YVRSTDPTMIKTLKLEILTNLANEANISTLLREFQTYVRSQ 417 

110 AI RRCSTFQRMI AEVQEECYS KLNVCS I AKRNPEAITE WQLPNHFS 156 

111:1 I : I I I : : I I I : : I : : I I : : 

418 D KQ FAAAT I Q T I GRCAT SISEFTETCFNGL-VCLLSNRDEIWAES WVIKKLLQMQ 473 

157 NRYYNRLVRSLLE-CDEDTVSTIRDSLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRR 215 

: : : I : : I I I I I : : I I I I : I 
474 PAQHGEI I RHMAKFLD S I TVPVARAS I LWLI GEN CERVPKIA 515 

216 TNEPQKLKVLLRNLRGEED 234 

I I: : :: |:| 
516 PDVLRKMAKSFTSEDD 531 



RESULT 9 
T50776 

hypothetical protein [imported] - Vitis vinifera 
C; Species: Vitis vinifera 

C;Date: 21-Jul-2000 #sequence_revision 21-Jul-2000 #text_change 21-Jul-2000 
C; Accession: T50776 



Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 



R;Matsumoto, S.; Dry, I.B.; Thomas, M. 
DNA Seq. 8, 109-112, 1997 

A;Title: Nucleotide sequence of grapevine (Vitis vinifera) cDNA similar to SNAP 
proteins . 

A; Reference number: Z25233 
A;Accession: T50776 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-289 <MAT> 

A/Cross-references : EMBL : AB001375 ; PIDN : BAA19246 . 1 
A; Experimental source: strain Shiraz 

Query Match 7.1%; Score 90.5; DB 2; Length 28 9; 

Best Local Similarity 19.9%; Pred. No. 2.7; 

Matches 41; Conservative 33; Mismatches 95; Indels 37; Gaps 5; 

Qy 8 LLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEVVRCLNSALQVGCGAFACLENSTCDT 67 

I : : I : : I I I I : : : : I : I I I I : I : I : 

Db 5 9 LSTVIQSSDSKHEAAQ--AYADAGHCYKKTSAKEAISCLEQA AYLFLDNGRFNM 110 

Qy 68 DG MYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQR 119 

I : | : : : : | : | : : | | | : I I : : | : 

Db 111 AGKYYKEIAELYELEQNFEQAIIYFEKAADIYQSEEATTAANQCNAKVAQFAAQLEQYQK 17 0 

Qy 12 0 MI AEVQEEC YS KLN VCS1AKRNPEAITEWQL PNHFSNR 15 8 

I : : II : I : I : I I I : I I 

Db 171 AIQIYEDIGRPSLNNNLLKYGVKGHLLNAGICQLCKGDWAITNALDRYQEMDPTFSGTR 230 



Qy 159 YYNRLVRS LLECDEDTVST I RDS LME 184 

I II II: I I : : I 

Db 231 EYKLLVDLAAAVDEEDWKFTDAVKE 256 



RESULT 10 
T38690 

probable regulatory protein - fission yeast ( Schizosaccharomyces pombe) 
C; Species: Schizosaccharomyces pombe 

C;Date: 03-Dec-1999 #sequence_revision 03-Dec-1999 #text_change 09-Jun-2000 
C; Accession: T38690 

R; Brown, D.; Churcher, CM. ; Wood, V.; Barrell, B.G.; Rajandream, M.A. 
submitted to the EMBL Data Library, September 1997 
A; Reference number: Z21805 
A; Accession: T38 690 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues : 1-7 83 <BRO> 

A; Cross-references: EMBL:Z99568; PIDN: CAB1 6735 . 1; GSPDB: GN00066; SPDB : SPAC3C7 . 04 
A; Experimental source: strain 972h-; cosmid c3C7 
C; Genetics : 

A; Gene: SPDB : SPAC3C7 . 04 
A; Map position: 1 

C; Superf amily : unassigned GAL4-type zinc cluster proteins; GAL4 zinc binuclear 
cluster homology 

F; 35-71/Domain: GAL 4 zinc binuclear cluster homology <GL4> 



Query Match 

Best Local Similarity 



7.1%; 
23.3%; 



Score 90.5; DB 2; Length 7 83; 
Pred. No. 8.7; 



Matches 56; Conservative 34; Mismatches 87; Indels 63; Gaps 



12; 



Qy 10 VLVI SASATHEAEQNDSVSPRKSRV AAQNSAEWRCLNSALQVGCGAF 57 

: : : : I : I III I : I I : : : I I : I : I : I . 

Db 535 IIIMSRPVLLHKMKNAKNSPRVDRINEDCILAARHLISLVHLLQNHSQLSCYSFFDYNYT 594 

Qy 58 ACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCI 99 

I : I : I II : I I I : : I I ' : I 

Db 595 FSSALWLLHCV-TEPCEED DIAMQYAYSALDYMAEGNEAAKNCARVIRLFDAHLKG 650 

Qy 100 ANGVT S KV- FLAI RRC ST FQRMI AEVQ EECYSKLNVCSIAKRNPEAITEW 149 

Mil: ] : I : II I I I I I : I I I I : : I 

Db 651 ARSDGNGNTSQSGFMA WQRWIAEVSAKDEPEKLMS P YNKS I GGGRNSNSLTPNA 704 

Qy 150 QLPNHFSNRYYNR LVRSLLECDEDTVSTIRDSLMEKIGPNMA SLFHILQTD 200 

| | : : | | | : | : : : | : | | : : | I I : II 

Db 705 NLGADVS — FFPTDDTSFLLDHSKLDDDLEKFASTLDPI — KTTPDLANDSSLLNWANTD 7 60 



RESULT 11 
T01491 

ubiquitin-protein ligase homolog F1707.15 - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 12-Feb-1999 #sequence_revision 12-Feb-1999 #text_change 04-Mar-2000 
C;Accession: T01491 

R;Vysotskaia, V.S.; Schwartz, J.R.; Toriumi, M. ; Yu, G. ; Kwan, A.; Oji, O.; Liu, 
S.; Li, J.; Araujo, R. ; Au, M. ; Brendel, V.; Buehler, E . ; Conway, A. B .;. Conway, 
A.R.; Dewar, K. ; Feng, J.; Kim, C; Kurtz, D. ; Li, Y. ; Palm, C.J.; Shinn, P.; 
Sun, H. ; Davis, R.W. ; Ecker, J.R.; Federspiel, N.A.; Theologis, A. 
submitted to the EMBL Data Library, June 1998 

A; Description: Arabidopsis thaliana chromosome 1 BAC F1707 sequence. 
A; Reference number: Z14334 
A/Accession: T01491 

A; Status: translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-1126 <VYS> 

A; Cross-references: EMBL : AC003671 ; NID : g2833627 ; PID : g3176690 ; GSPDB : GN00059 ; 
ATSP:F1707.15 

A; Experimental source: cultivar Columbia 

C; Genetics : 

A; Gene: ATSP : F1707 . 15 

A; Map position: 1 

A;Introns: 118/3; 224/3; 292/1; 373/3; 457/2; 826/3; 875/3; 902/3; 1057/3; 
1093/2 

C; Super family: ubiquitin-protein ligase homology 
F;756-1120/Domain: ubiquitin-protein ligase homology <UBI> 

Query Match 7.1%; Score 89-5; DB 2; Length 1126; 

Best Local Similarity 22.1%; Pred. No. 16; 

Matches 46; Conservative 39; Mismatches 76; Indels 47; Gaps 8; 

Qy 14 SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCD ' 66 

I : : || I I : I I III: III: : : I : 

Db 412 SSETQKDAESELSVARRK — NCAEL YNIFLQLP QSDLCNLCMLLGY 455 

Qy 67 TDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAE 123 



Db 456 EGLSDKIYSLAGEVLKKLAAVDVTHRKFFTKELSELASGLSSSTVRVLATLSTTQKM 512 



Qy 124 VQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIR 179 

111:1 : I I : I : : : : : I : I I : : : ' ' 

Db 513 SQNTCSMA GAS I LRVLQVLS SLTST I DDSNVGTDKETDQEEQNIMQGLKV 562 

Qy 180 — DSLMEKIGPNMASLFHILQTDHCAQT 205 

: I : : : I : : I I I I I I 

Db 563 ALEPLWQELGQCIS — MTELQLDHTAAT 588 



RESULT 12 
T08121 

peroxidase (EC 1.11.1.7) - flax (fragment) 
C; Species: Linum usitatissimum (flax) 

C;Date: 21-May-1999 #sequence_revision 21-May-1999 #text_change 04-Mar-2000 
C;Accession: T08121 
R;Omann, F. ; Tyson, H. 

submitted to the EMBL Data Library , February 199 8 

A; Description: cDNA sequence of a basic peroxidase (FLXPER4) in flax. 
A; Reference number: Z16366 
A;Accession: T08121 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-305 <OMA> 

A;Cross-references: EMBL : AFO 4 98 81; NID : g2944416; PIDN : AAC05277 . 1 ; PID:g2944417 
A; Experimental source: cv. Stormont Cirrus 
C; Genetics : 
A; Gene: PER4 

C; Super family: peroxidase 

C;Keywords: chromoprotein; heme; iron; metalloprotein; oxidoreductase 
F;23-103/Disulfide bonds: #status predicted 
F;50/Active site: Arg #status predicted 

F;54, 181/Binding site: heme iron (His) (axial ligands) #status predicted 
F; 56-61/Disulf ide bonds: #status predicted 
F; 109-301/Disulf ide bonds: #status predicted 
F; 188-2 13/Disulf ide bonds: #status predicted 

Query Match 7.0%; Score 89; DB 2; Length 305; 

Best Local Similarity 21.9%; Pred. No. 3.8; 

Matches 57; Conservative 39; Mismatches 92; Indels 72; Gaps 15; 

ISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGA-FACLE-NSTCDTD 68 
: : I I 1:1 I I ::::::: I I : I : : I 

LDDTAT FTGEKN — AGPNQNSV RGFDI I DT I KT RVEAACN AT VS CAD I LALAARD 119 

YDI CKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLA 110 

: : : I I I I I I I I : I I 



QY 


11 


Db 


65 


Qy 


69 


Db 


120 


Qy 


111 


Db 


177 


Qy 


153 


Db 


230 



IRRCSTFQRMIAEVQEECYSKLNV- CSIAKRN-PEA ITEWQLP 152 

I I : I I : : I I: h : : I |:| : : I 

1 GAHT I GQARCTT FRQRI YN DTN I D P AFAT T RRGN C P QAGAG AN LAP L D GT P 229 

[FSNRYYNRLV — RSLLECDEDTVST-IRDSLMEKIGPNMASLFHILQTDHCAQ 204 

I I I II II III I : : : : I : I : I h M I 



205 THPRADFNRRRTN 217 

I : I I I I I 

286 NISPLTGTNGEIRFNCRRPN 305 



RESULT 13 
T45954 

hypothetical protein F7J8.120 - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date': 04-Feb-2000 #sequence_revision 04-Feb-2000 #text_change 20-Jun-2000 
C /Accession: T45954 

R;Bevan, M. ; Zimmermann, W.; Grueneisen, A.; Wambutt, R. ; Bancroft, I.; Mewes, 
H.W.; Lemcke, K. ; Mayer, K.F.X. 

submitted to the Protein Sequence Database, January 2000 

A; Reference number: Z23018 

A;Accession: T45954 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-473 <BEV> 

A;Cross-references : EMBL: AL137189 

A; Experimental source: cultivar Columbia; BAC clone F7J8 

C; Genetics: 

A; Map position: 5 

A;Introns: 29/1; 428/3 

A;Note: F7J8.120 

C;Superfamily: Arabidopsis thaliana hypothetical protein F7J8.130 

Query Match 6.9%; Score 88; DB 2; Length 473; 

Best Local Similarity 22.6%; Pred. No. 7.9; 

Matches 54; Conservative 42; Mismatches 103; Indels 40; Gaps 13; 

27 VS P RK S RVAAQN S AE WRC LN S AL Q VGC GAF AC L EN - - - S T C D TDGMYDICKS-FLY 79 

: |.:: | : :|| I : III II I Mi ll 

16 I D E E KN KWLAEAG T I VRL LAKH RK S D P VT IGCLRNLYT SWDME L D D F ET D AC KQML L Y 75 

80 SAAKFDTQGKAF VKESLKCIANGVTSKVFLAIRRCSTFQRMIAE — - — VQEEC 128 

: I : I Mill : I I I I I : : : I I 

76 PKNIREAQYRNFKLNIDTNESLKCFG— —CRFFSICRMCSNFNTSLCKCGKLMNEEISF 131 

12 9 — YSKLNVCSIAKRNPEA — ITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLME 184 

| :" : I : I : : | | : ' : : I : :. : | : : : | : . I I : I : : : : 

132 LEYEENDVEGVFMRDKSSFIITDDLRLTDDST SSLLQTLKDLGCADVSKLREQVLD 187 

185 KIGPNMASLFHILQTDHCAQTH — PRAD — FNRRRTNEPQKLKVLLRNLRGEEDSPSHI 239 

|| | : : II II : : | : I : I . : : I : I I ' : 

188 -IG LKEVMTLMQCVFTSNTPLTDAFLKNQSSNTVRKIYRKLSDDKGDEAEPDKV 240 



QY 
Db 



Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db 



RESULT 14 
S45628 

DNA-directed DNA polymerase (EC 2.7.7.7) alpha 180K chain - mouse 
N;Alternate names: DNA polymerase alpha/DNA primase complex 180K chain 
C; Species: Mus musculus (house mouse) 

C;Date: 13-Jan-1995 #sequence_revision 13-Jan-1995 #text_change 27-Oct-2003 
C;Accession: S45628; A46642 



R; Stadlbauer, F. ; Brueckner, A.; Rehfuess, C; Eckerskorn, C. ; Lottspeich, F. ; 
Foerster, V.; Tseng, B.Y.; Nasheuer, H.P. 
Eur. J. Biochem. 222, 781-793, 1994 

A; Title: DNA replication in vitro by recombinant DNA-polymerase-alpha-primase . 

A; Reference number: S45628; MUID: 94298818 ; PMID: 8026492 

A;Accession: S45628 

A; Status: preliminary 

A; Molecule type: mRNA 

A; Residues: 1-1465 <STA> 

A;Cross-references : EMBL : D17384 ; NID:g397829; PIDN : BAA04202 . 1 ; PID:g442471 
R;Miyazawa, H.; Izumi, M. ; Tada, S.; Takada, R. ; Masutani, M. ; Ui, M. ; Hanaoka, 
F. 

J. Biol. Chem. 268, 8111-8122, 1993 

A; Title: Molecular cloning of the cDNAs for the four subunits of mouse DNA 
polymerase alpha-primase complex and their gene expression during cell 
proliferation and the cell cycle. 

A; Reference number: A46642; MUID : 93216788 ; PMID: 8463324 

A; Accession : A46642 

A; Status: preliminary 

A; Molecule type: mRNA; protein 

A; Residues: 4-1465 <MIY> 

A; Experimental source: FM3A cells 

A; Note: sequence extracted from NCBI backbone (NCBIN : 129146, NCBIP : 129147 ) 
C; Superf amily : DNA polymerase 

C; Keywords: DNA binding; nucleotidyltransferase; nucleus 

Query Match 6.9%; Score 87.5; DB 2; Length 1465; 

Best Local Similarity 20.7%; Pred. No. 33; 

Matches 34; Conservative 32; Mismatches 55; Indels 43; Gaps 7; 

Qy 6 AVLLVLVI S AS AT - HEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFA 58 

I I I : I : : I I : I : I I : : : : : : : II 

Db 1236 AVLIALWLGLDSTQFRVHQYHKDEENDALLGGPAQLTDEEKYK DCEKFK 1284 

Qy 59 CLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVT SKVFLAIR 112 

II I I I : : I I : : : | : : : : | : : | : I : : I I 

Db 12 85 CLCPS-CGTENIYD :-NVFEGSGLDMEPSLYRCSNVDCKVSPLTFMVQLSNKLIMDIR 1339 

Qy 113 RCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFS 156 

II : : I : I I : ': : I I I I I 

Db 1340 RCI — — KKYYDGWLIC EEPTCCSRLRRLPLHFS 1369 



RESULT 15 
H72097 

mg++ transporter (cbs domain) - Chlamydophila pneumoniae (strain CWL029) 
C; Species: Chlamydophila pneumoniae, Chlamydia pneumoniae 

C;Date: 23-Apr-1999 #sequence_revision 23-Apr-1999 #text_change 05-May-2000 
C;Accession: H72097 

R;Kalman, S.; Mitchell, W. ; Marathe, R. ; Lammel, C; Fan, J.; Olinger, L.; 
Grimwood, J.; Davis, R.W. ; Stephens, R.S. 
Nature Genet. 21, 385-389, 1999 

A; Title: Comparative genomes of Clamydia pneumoniae and C. trachomatis. 
A; Reference number: A72000; MUID: 99206606; PMID : 10192388 
A;Accession: H72097 
A; Status: preliminary 
A;Molecule type: DNA 



A; Residues: 1-470 <ARN> 

A;Cross-references: GB:AE001613; GB:AE001363; NID : g4376550 ; PIDN : AAD18435 . 1 ; 
PID:g4376560 

A; Experimental source: strain CWL029 
C; Genetics : 
A; Gene: mgtE 

C; Superf amily : magnesium transport protein mgtE 

Query Match 6.8%; Score 86; DB 2; Length 470; 

Best Local Similarity 22.9%; Pred. No. 12; 

Matches 47; Conservative 33; Mismatches 61; Indels 64; Gaps 13; 

Qy 56 AFACLENSTCDTD-GMYDICK SFLYSAAKFDTQGKAFVKESLKCIANGVTSK 106 

MM M M : I I : : : : : I : : : I M Ml 

Db 19 AFTCL STDIHSHDLSKIVIEYNPIDLAYAVSCLPSESRAILYKNLSCI TAK 69 

Qy 107 VFLAIR RCSTFQRMIAEVQEECYSKLNVCSIAKRNP — EAITEWQLPNHFSNRY 159 

I I I : I : 1 : I M : : : : I M i : : I : I 

Db 70 VAFI INTDSASRWAI FRRL- SDSEVCALIEQMPPDEAVWVLDDIP DRR 116 

Qy 160 YNRLVRSLLECDEDTVSTIRDSLMEKIGPNMA SLFHIL- QTDHCAQTHP 207 

I I : : : I M I : : I I I I II I : : : I 

Db 117 YRRILELI — DSKKALKIRD— LQKHGRNTAGRLMTNEFFAFLMETTVKDVSACIRSNP 171 

Qy 2 08 RADFNRRRTNEPQKLKVLLRNLRGE 232 

II | : : : | | 

Db 172 GIDLTR LVFVLDFKGE 187 



Search completed: July 19, 2004, 15:33:57 
Job time : 19 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 



July 19, 2004, 15:33:34 ; Search time 46 Seconds 

(without alignments) 
1678.315 Million cell updates/sec 

US-10-614-990-2 
1268 

1 MLQNSAVLLVLVI SASATHE NLRGEEDSPSHIKRTSHESA 247 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1285345 seqs, 312560633 residues 



1285345 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1: /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2: /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep:* 

3: /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep: * 

4: /cgn2_6/ptodata/2/pubpaa/US06J?UBCOMB.pep:* 

5 : / cgn2_6/ptodata/2/pubpaa/US 07NEWPUB . pep : * 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: 

7: /cgn2_6/ptodata/2/pubpaa/US08__NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9: /cgn2__6/ptodata/2/pubpaa/US09A_PUBCOMB.pep: 

10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep 

11:. /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep 

12: /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: 

13: /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep 

14: /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep 

15: /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep 

16: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: 

17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB . pep : 

18: /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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ALIGNMENTS 



RESULT 1 

US-09-840-989A-2 

; Sequence 2, Application US/09840989A 

; Patent No. US20020042372A1 

; GENERAL INFORMATION: 

; APPLICANT: Olsen et al. 



; TITLE OF INVENTION: Stanniocalcin Polynucleotides , Polypeptides,, and Methods 
Based Thereon 

FILE REFERENCE: PF108P2 

CURRENT APPLICATION NUMBER: US/09/840, 989A 
CURRENT FILING DATE: 2001-04-25 
PRIOR APPLICATION NUMBER: PCT/US00/29432 
PRIOR FILING DATE: 2000-10-26 
PRIOR APPLICATION NUMBER: US 60/161,740 
PRIOR FILING DATE: 1999-10-27 
NUMBER OF SEQ ID NOS : 12 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 247 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-840-989A-2 

Query Match 100.0%; Score 1268; DB 9; Length 247; 

Best Local Similarity 100.0%; Pred. No. 5.5e-125; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEVVRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I I I 

Db 241 RTSHESA 247 



RESULT 2 

US-09-361-736-10 

; Sequence 10, Application US/09361736 
; Patent No. US20020102634A1 
; GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL. 

TITLE OF INVENTION: Human Stanniocalcin-alpha 
NUMBER OF SEQUENCES: 10 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
ADDRESSEE: CECCHI, STEWART & OLSTEIN 
STREET: 6 BECKER FARM ROAD 
CITY: ROSELAND 
; STATE: NEW JERSEY 

COUNTRY: USA 



ZIP: 07068 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH DISKETTE 
COMPUTER: IBM PS/2 
OPERATING SYSTEM: MS-DOS 
SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/361, 736 
FILING DATE: 
CLASSIFICATION: 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/460,529 
FILING DATE: 
ATTORNEY/AGENT INFORMATION: 
NAME: MULLINS, J.G. 
REGISTRATION NUMBER: 33,073 

REFERENCE/ DOCKET NUMBER: 325800-334 (PF143) 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 201-994-1700 
TELEFAX: 201-994-174 4 
INFORMATION FOR SEQ ID NO: 10: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 247 AMINO ACIDS 
TYPE: AMINO ACID 
STRANDEDNESS: 
TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-09-361-736-10 

Query Match 100.0%; Score 1268; DB 9; Length 247; 

Best Local Similarity . 100.0%; Pr.ed. No. 5.5e-125; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MLQN SAVLLVLVI SASATHEAEQNDSVS P RK S RVAAQN S AEWRC LN S ALQVGC GAFAC L 6 0 

ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

RTSHESA 247 

I II I II I 

RTSHESA 247 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy ' 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



RESULT 3 
US-10-116-051-2 

; Sequence 2, Application US/10116051 



; Publication No. US20020146791A1 

; GENERAL INFORMATION: 

; APPLICANT: Olsen et al. 

; TITLE OF INVENTION: CORPUSCLES OF STANNIUS PROTEIN, STANNIOCALCIN 
; FILE REFERENCE: PF108P1D1C1 

; CURRENT APPLICATION NUMBER: US/10/ 116, 051 

; CURRENT FILING DATE: 2002-04-05 

; PRIOR APPLICATION NUMBER: 09/312,610 

; PRIOR FILING DATE: 1999-05-17 

; PRIOR APPLICATION NUMBER: 08/431,117 

; PRIOR FILING DATE: 1995-04-28 

; PRIOR APPLICATION NUMBER: 08/208,005 

; PRIOR FILING DATE: 1994-03-08 

; NUMBER OF SEQ ID NOS : 10 

; SOFTWARE: Patentln version 3.1 

; SEQ ID NO 2 

LENGTH: 2 47 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-116-051-2 

Query Match 100.0%; Score 1268; DB 13; Length 247; 

Best Local Similarity 100.0%; Pred. No. 5.5e-125; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGWSKVFLAI RRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I i I I 

Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 



Qy 241 RTSHESA 247 

II I I I I I 

Db 241 RTSHESA 247 



RESULT 4 

US-10-177-293-441 

Sequence 441, Application US/10177293 
Publication -No. US20030124 128A1 
GENERAL INFORMATION: 
APPLICANT: Lillie, James 
APPLICANT: Glatt, Karen 
APPLICANT: Zhao, Xumei 
APPLICANT: Gannavarpu, Manjula 
APPLICANT: Kamatkar, Shubhangi 
APPLICANT: Mertens, Maureen 



APPLICANT: Myer f Vic 
APPLICANT: Wang, Youzhen 
APPLICANT: Xu, Yongyao 
APPLICANT: Hoersch, Sebastian 
APPLICANT: Monahan, John 
APPLICANT: Meyers, Rachel E. 
APPLICANT: Bast Jr., Robert C. 
APPLICANT: Hortobagyi, Gabriel N. 
APPLICANT: Pusztai, Lajos 
APPLICANT: Meric, Funda 
APPLICANT:' Sahin, Aysegul 
APPLICANT: Mills, Gordon B. 

TITLE OF INVENTION: COMPOSITIONS, KITS, AND METHODS FOR IDENTIFICATION, 
ASSESSMENT, 

TITLE OF INVENTION: PREVENTION, AND THERAPY OF BREAST CANCER 
FILE REFERENCE: MRI-038 

CURRENT APPLICATION NUMBER: US/10/177 , 293 
CURRENT FILING DATE: 2002-06-21 
PRIOR APPLICATION NUMBER:' US 60/299, 887 
PRIOR FILING DATE: 2001-06-21 
PRIOR APPLICATION NUMBER: US 60/301,572 
PRIOR FILING DATE: 2001-06-27 
. PRIOR APPLICATION NUMBER: US 60/306,501 
PRIOR FILING DATE: 2001-07-18 
PRIOR APPLICATION NUMBER: US 60/325,002 
PRIOR FILING DATE: 2001-09-25 
PRIOR APPLICATION NUMBER: US 60/362,585 
PRIOR FILING DATE: 2002-03-05 
PRIOR APPLICATION NUMBER: US 60/xxx,xxx 
PRIOR FILING DATE: 2002-05-14 
NUMBER OF SEQ ID NOS : 506 

SOFTWARE : FastSEQ for Windows Version 4.0 
SEQ ID NO 441 
LENGTH: 2 47 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-177-293-441 

Query Match 100.0%; Score 1268; DB 14; Length 247;- 

Best Local Similarity 100.0%; Pred. No. 5.5e-125; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

1 1 I 1 1 1 1 1 1 1 1 I I I 1 1 1 1 1 I I I I I ! I I I I I I 1 1 I 1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 I 1 1 1 1 1 1 I I I I 

MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 
ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 12 0 

I I I I I I I I I I I I I I I I II M II I'M I II I I I M I I I I II I I II I I I I I 

ENSTCDTDGMYDI CKS FLYSAAKFDTQGKAFVKES LKCIANGVTSKVFLAI RRCST FQRM 120 
IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I M I I I I I I I I I I I I i I I I I I I I I I 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

| \ \ | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 



Qy 241 RTSHESA 247 

I I II I I I 

Db 241 RTSHESA 247 



RESULT 5 

US-10-418-226-10 

Sequence 10, Application US/10418226 
Publication No. US20030181663A1 
GENERAL INFORMATION: 
APPLICANT: Human Genome Sciences, Inc. 
TITLE OF INVENTION: Human Stanniocalcin-Alpha 
FILE REFERENCE: PF143P1D2 

CURRENT APPLICATION NUMBER: US/ 10/418 , 226 
CURRENT FILING DATE: 2003-04-18 
PRIOR APPLICATION NUMBER: 09/361,736 
PRIOR FILING DATE: 1999-07-28 
PRIOR APPLICATION NUMBER: 08/460,529 
PRIOR FILING DATE: 1995-06-02 
PRIOR APPLICATION NUMBER: PCT/ US94/13206 
PRIOR FILING DATE: 1994-11-30 
NUMBER OF SEQ ID NOS : 12 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 10 
LENGTH: 247 
TYPE: PRT 
ORGANISM: human 
US-10-418-226-10 

Query Match 100.0%; Score 1268; DB 14; Length 247; 

Best Local Similarity 100.0%; Pred. No. 5.5e-125; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0;< 

MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 
| J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MLQN S AVLLVLVI S ASAT HEAEQN D S VS P RKS RVAAQN S AEWRCLN S ALQVGCGAFAC L 60 

ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 

ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 
IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M Ml II I I M 

IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 
SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 

I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I 

SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 



I I I I I I 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



RESULT 6 

US-10-465-572-18 



; Sequence 18, Application US/10465572 

; Publication No. US20030207840A1 

; GENERAL INFORMATION: 

; APPLICANT: Riggins, Gregory 

; APPLICANT: Lai, Anita 

; TITLE OF INVENTION: GENES INDUCED BY HYPOXIA 

; FILE REFERENCE: 000250.00012 

; CURRENT APPLICATION NUMBER: US/10/465,572 

; CURRENT FILING DATE: 2003-06-20 

; PRIOR APPLICATION NUMBER: US/10/201,642 

; PRIOR FILING DATE: 2002-07-24 

; PRIOR APPLICATION NUMBER: 60/307,600 

; PRIOR FILING DATE: 2001-07-26 

; NUMBER OF SEQ ID NOS : 30 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 18 ' 

LENGTH: 247 
; TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-465-572-18 



Query Match 100.0%; Score 1268; DB 15; Length 247; 

Best Local Similarity 100.0%; Pred. No. 5.5e-125; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MLQN S AVLLVLVI S AS ATHEAEQND S VS P RKS RVAAQN S AE WRCLN S ALQVGCGAFAC L 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 12 0 



Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

III I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I II I I I 

Db 241 RTSHESA 247 



RESULT 7 

US-10-372-683-41 

; Sequence 41, Application US/10372683 

; Publication No. US20040009171A1 

; GENERAL INFORMATION: 

; APPLICANT: GERRITSEN, MARY E. 

; APPLICANT: PEALE JR., FRANKLIN V. 

; APPLICANT: WU, THOMAS D. 

; TITLE OF INVENTION: METHODS FOR THE TREATMENT OF CARCINOMA 
; FILE REFERENCE: P1928R1P1 

; CURRENT APPLICATION NUMBER: US/10/372 , 683 



CURRENT FILING DATE: 2003-02-21 
PRIOR APPLICATION NUMBER: US 10/271,690 
PRIOR FILING DATE: 2002-10-16 
PRIOR APPLICATION NUMBER: US 60/344,534 
PRIOR FILING DATE: 2001-10-18 
NUMBER OF SEQ ID NOS : 49 
SEQ ID NO 41 
LENGTH: 24 7 
TYPE: PRT 

ORGANISM: Homo sapien 
US-10-372-683-41 

Query Match 100.0%; Score 1268; DB 15; Length 247; 

Best Local Similarity 100.0%; Pred. No. 5.5e-125; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLQNSAVLLVLVI SASATHEAEQNDSVS PRKSRVAAQNSAEVVRCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 1 I I I 

Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 

Qy 241 RTSHESA 247 

I I I I I I I 

Db 241 RTSHESA 2 47 



RESULT 8 

US-09-925-300-1426 

; Sequence 1426, Application US/09925300 

; Patent No. US20020151681A1 

; GENERAL INFORMATION: 

; APPLICANT: Craig Rosen, 

; APPLICANT: Steve Ruben 

; TITLE OF INVENTION: Nucleic Acids, Proteins and Antibodies 
; FILE REFERENCE: PA101 

; CURRENT APPLICATION NUMBER: US/09/925 , 300 

; CURRENT FILING DATE: 2001-08-10 

; PRIOR APPLICATION NUMBER: PCT/US00/05988 

; ■ PRIOR FILING DATE: 2000-03-08 

; PRIOR APPLICATION NUMBER: 60/124,270 

; PRIOR FILING DATE: 1999-03-12 

; NUMBER OF SEQ ID NOS: 1890 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 1426 

LENGTH: 276 

TYPE: PRT 



ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY : SITE 
LOCATION: (43) 

OTHER INFORMATION: Xaa equals . any of the naturally occurring L-amino acids 
NAME/KEY: SITE 
LOCATION: (273) 

OTHER INFORMATION: Xaa equals any of the naturally occurring L-amino acids 
NAME/KEY: SITE 
LOCATION: (275) 

OTHER INFORMATION: Xaa equals any of the naturally occurring L-amino acids 
US-09-925-300-1426 



Query Match 73.9%; Score 937; DB 9; Length 276; 

Best Local Similarity 99.5%; Pred. No. 4.8e-90; 

Matches 185; Conservative 0; Mismatches 1; Indels 0; 



Gaps 



0; 



QY 
Db 



1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 1 1 1 1 1 

89 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 148 



Qy 

Db 

Qy 

Db 



61 ENSTCDTDGMYDICKSFLYSAAXFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I I I I I I I I I I I I I I I I 
149 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 208 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I M I II 

209 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 268 



QY 
Db 



181 SLMEKI 186 
I I II I 

269 SLMEXI 274 



RESULT 9 

US-09-840-989A-3 

; Sequence 3, Application US/09840989A 

; Patent No. US20020042372A1 

; GENERAL INFORMATION: 

; APPLICANT: 01 sen et al . 

TITLE OF INVENTION: Stanniocalcin Polynucleotides, Polypeptides , and Methods 
Based Thereon 

; FILE REFERENCE: PF108P2 

; CURRENT APPLICATION NUMBER: US/09/840, 989A 
; CURRENT FILING DATE: 2001-04-25 
; PRIOR APPLICATION NUMBER: PCT/US00/29432 
; PRIOR FILING DATE: 2000-10-26 
; PRIOR APPLICATION NUMBER: US 60/161,740 
' ; PRIOR FILING DATE: 1999-10-27 
; NUMBER OF SEQ ID NOS: 12 
; SOFTWARE: Patentln version 3.1 
; SEQ ID NO 3 

LENGTH: 256 
; TYPE: PRT 

; ORGANISM: Oncorhynchus kisutch 
US-09-840-989A-3 



Query Match 52.2%; Score 662.5; DB 9; 

Best Local Similarity 53.4%; Pred. No. 3.6e-61; 
Matches 125; Conservative 46; Mismatches 54; 



Length 256; 
Indels 9; Gaps 



4; 



Qy 11 LVI S AS AT H EAEQN D S VS P RKS RVAAQN S AEWRC LN S ALQVGC GAFACLEN S T CDT D GM 70 

||: : I I : : :: I I I :: I ::::: I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 12 L VL GT AAT FDT D P E EA- S P RRAR FS S N S P S D VARC LN G ALAVGC GT FAC LENSTCDTD GM 7 0 

Qy 71 YDICKSFLYSAAKFDTQGKAFVKESLKCIANGWSKVFLAIRRCSTFQRMIAEVQEECYS 130 

: I I I : I :: I I I : I I I I I I I I I I : I I I I I I I I I I I Mil I I I I I : I I I I I I I I 
Db 71 HDICQLFFHTAATFNTQGKTFVKESLRCIANGVTSKVFQTIRRCGVFQRMISEVQEECYS 130 

Qy 131 KLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNM 190 

: I : : I : I : I I I I I I I I I : I II INI: I : : I I I I M : M : : I I : : : I I : I 

Db 131 RLDICGVARSNPEAIGEWQVPAHFPNRYYSTLLQSLLACDEETVAWRAGLV7VRLGPDM 190 

Qy 191 ASLFHILQTDHCAQTHPRADFN RRRTNEPQKLKVLLRNLRGEEDSPSHI 239 

: II : M II I : : I I I : :: I I : I : I : 

Db 191 ETLFQLLQNKHCPQGSNQGPNSAPAGWRWPMGSPPSFKI-QPSMRGRD— PTHL 241 



RESULT 10 
US-10-116-051-10 

Sequence 10, Application US/10116051 
Publication No. US20020146791A1 
GENERAL INFORMATION: 
APPLICANT: Olsen et al . 

TITLE OF INVENTION: CORPUSCLES OF STANNIUS PROTEIN, STANNIOCALCIN 
FILE REFERENCE: PF108P1D1C1 

CURRENT APPLICATION NUMBER: US/ 10/ 116, 051 
CURRENT FILING DATE: 2002-04-05 
PRIOR APPLICATION NUMBER: 09/312,610 
PRIOR FILING DATE: 1999-05-17 
PRIOR APPLICATION NUMBER: 08/431,117 
PRIOR FILING DATE: 1995-04-28 
PRIOR APPLICATION NUMBER: 08/208,005 
PRIOR FILING DATE: 1994-03-08 
NUMBER OF SEQ ID NOS : 10 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 10 
LENGTH: 204 
TYPE: PRT 

ORGANISM: Oncorhynchus kisutch 
US-10-116-051-10 

Query Match 52.1%; Score 660.5; DB 13; Length 204; 

Best Local Similarity 60.8%; Pred. No. 4.3e-61; 

Matches 118; Conservative 38; Mismatches 37; Indels 1; Gaps 1; 

Qy 11 LVISASATHEAEQNDSVSPRKSRVTiAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGM 70 

||: : | I : : : : I I I : : I : : : : : I MM II II II I II I II M II I II I 

Db 12 LVLGTAATFDTDPEEA-SPRRARFSSNSPSDVARCLNGALAVGCGTFACLENSTCDTDGM 70 

Qy 71 YDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRMIAEVQEECYS 130 

: M I: I : : I I I : M M M M M : M I M M M I I MM II II I : II M I I II 
Db 71 HDICQLFFHTAATFNTQGKT FVKESLRCIANGVTSKVFQTIRRCGVFQRMISEVQEECYS 130 



131 



KLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNM 190 



Db 



131 




191 



ASLFHILQTDHCAQ 204 



Db 



191 



ETLFQLLQNKHCPQ 2 04 



RESULT 11 
US-09-361-736-9 

; Sequence 9, Application US/09361736 
; Patent No. US20020102634A1 
GENERAL INFORMATION: 

APPLICANT: OLSEN, ET AL. 

TITLE OF INVENTION: Human Stanniocalcin-alpha 
NUMBER OF SEQUENCES: 10 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: CARELLA, BYRNE, BAIN, GILFILLAN, 
; ADDRESSEE: CECCHI, STEWART & OLSTEIN 

STREET: 6 BECKER FARM ROAD 
CITY: ROSELAND. 
STATE: NEW JERSEY 
COUNTRY: USA 
ZIP: 07068 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: 3.5 INCH DISKETTE 

; COMPUTER: IBM PS/2 

OPERATING SYSTEM: MS-DOS 
SOFTWARE: WORD PERFECT 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/361 , 736 

FILING DATE: ' . 

; CLASSIFICATION: 

•PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/460,529 
FILING DATE: 
ATTORNEY/AGENT INFORMATION : 
NAME: MULLINS, J.G. 
; REGISTRATION NUMBER: 33,073 

; REFERENCE/ DOCKET NUMBER: 325800-334 (PF143) 

; . TELECOMMUNICATION INFORMATION: 

TELEPHONE: 201-994-1700 
; TELEFAX: 201-994-1744 

; INFORMATION FOR SEQ ID NO: 9: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 170 AMINO ACIDS 

TYPE: AMINO ACID 

STRANDEDNESS: 

TOPOLOGY: LINEAR 
MOLECULE TYPE: PROTEIN 
US-09-361-736-9 

Query Match 49.5%; Score 628; DB 9; Length 170; 

Best Local Similarity 66.5%; Pred. No. 8.8e~58; 

Matches 113; Conservative 30; Mismatches 27; Indels 0; Gaps 0 



Qy 28 SPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQ 87 

III : I : I : : : I I I I I I I II I I II I I I : II I I : I I I I : : I I : I I I : I I I I I I I 

Db 1 S PRTARFSAS S P S DVARCLNGALQVGC SAFACLDNSTCNTDGMHEI CRS FLHGAAKFDTQ 60 

Qy 88 GKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE 147 

. II I I I I I I I I I I I I : I I I I I I I I I I I : I I : I I : I I I I I I I I I I :: I I : I : 1111= I 

Db 61 GKTFVKESLKCIANGITSKVFLTIRRCSSFQKMISEVQEECYSKLDLCSVAQSNPEAMGE 120 

Qy 148 WQLPNHFSNRYYNRLVRS LLECDEDTVSTI RDSLMEKI GPNMAS LFHI L 197 

I I : I : I I I I I : I I I I I M I I I : I I : : : I I I I : I 
Db 121 VAQVPSQFPNRYYSTLLQSLLTCDEDTVEQVRAGLVSRLEPEMGVLFQLL 170 



RESULT 12 
US-10-418-226-9 

Sequence 9, Application US/10418226 
Publication No. US2003018 1663A1 
GENERAL INFORMATION: 
APPLICANT: Human Genome Sciences , Inc. 
TITLE OF INVENTION: Human Stanniocalcin-Alpha 
FILE REFERENCE: PF143P1D2 

CURRENT APPLICATION NUMBER: US/ 10/4 18 , 226 
CURRENT FILING DATE: 2003-04-18 
PRIOR APPLICATION NUMBER: 09/361,736 
PRIOR FILING DATE: 1999-07-28 
PRIOR APPLICATION NUMBER: 08/460,529 
PRIOR FILING DATE: 1995-06-02 
PRIOR APPLICATION NUMBER: PCT/ US94/13206 
PRIOR FILING DATE: 1994-11-30 
NUMBER OF SEQ ID NOS : 12 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 9 . 
LENGTH: 170 
TYPE: PRT 

ORGANISM: Anguilla australis 
US-10-418-226-9 

Query Match 49.5%; Score 628; DB 14; Length 170; 

Best Local Similarity 66.5%; Pred. No. 8.8e-58; 

Matches 113; Conservative 30; Mismatches 27; Indels 0; Gaps 0; 

Qy 28 SPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQ 87 

I I I : I : I ::: I II I I I I I I I I I I I I I : I I I I : 1 II I :: I I : I I I : I I I I I I I 

Db 1 S PRTARFSAS S PS DVARCLNGALQVGCSAFACLDNSTCNTDGMHEICRSFLHGAAKFDTQ 60 

Qy 88 GKAFVKESLKCIANGVTSKVFLAI RRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE 147 

II I I I I M I I I II I : I I II I I I I I I I : I I : I I : I I I I I I I I I I :: I I : I : MM: I 

Db 61 GKTFVKESLKCIANGITSKVFLTIRRCSSFQKMISEVQEECYSKLDLCSVAQSNPEAMGE 120 

Qy 14 8 WQLPNHFSNRYYN RLVRS LLECDEDTVSTI RDSLMEKI GPNMASLFHIL 197 

I I : I : I I I I I : I :: I I I I I I I I I : I I : : : I I I I : I 
Db 121 VAQVPSQFPNRYYSTLLQSLLTCDEDTVEQVRAGLVSRLEPEMGVLFQLL 170 



RESULT 13 

US-09-864-761-37770 

; Sequence 37770, Application US/09864761 



; Patent No. US20020048763A1 
; GENERAL INFORMATION: 
; APPLICANT: Penn, Sharron G. 
; APPLICANT: Rank, David R. 
; APPLICANT: Hanzel, David K. 
; APPLICANT: Chen, Wensheng 

; TITLE OF INVENTION: HUMAN GENOME-DERIVED SINGLE EXON NUCLEIC ACID PROBES 
USEFUL FOR 

; TITLE OF INVENTION: GENE EXPRESSION ANALYSIS BY MICROARRAY 

; FILE REFERENCE: Aeomica-X-1 

; CURRENT APPLICATION NUMBER: US/09/864 , 761 

; CURRENT FILING DATE: 2001-05-23 

; PRIOR APPLICATION NUMBER: US 60/180,312 

; PRIOR FILING DATE: 2000-02-04 

PRIOR APPLICATION NUMBER: US 60/207,456 
; PRIOR FILING DATE: 2000-0-5-26 
; PRIOR APPLICATION NUMBER: US 09/632,366 
; PRIOR FILING DATE: 2000-08-03 
; PRIOR APPLICATION NUMBER: GB 242 63.6 
; PRIOR FILING DATE: 2000-10-04 
; PRIOR APPLICATION NUMBER: US 60/236,359 

PRIOR FILING DATE: 2000-09-27 

PRIOR APPLICATION NUMBER: PCT/US01/00666 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/00667 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/00664 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/00669 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/00665 
; PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/US01/00668 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/ 00663 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/00662 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/00661 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/US01/00670 
; PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: US 60/234,687 
; PRIOR FILING DATE: 2000-09-21 
; PRIOR APPLICATION NUMBER: US 09/608,408 
; PRIOR FILING DATE: 2000-06-30 
; PRIOR APPLICATION NUMBER: US 09/774,2 03 
; PRIOR FILING DATE: 2001-01-29 
; NUMBER OF SEQ ID NOS : 4 9117 

; SOFTWARE: Annomax Sequence Listing Engine vers. 1.1 
; SEQ ID NO 37770 

LENGTH: 70 

TYPE : PRT 
; ORGANISM: Homo sapiens 

FEATURE: 

OTHER INFORMATION: MAP TO AC012119.1 

OTHER INFORMATION: EXPRESSED IN HELA, SIGNAL =1.7 



OTHER INFORMATION: EXPRESSED IN HBL100, SIGNAL = 2.4 

OTHER INFORMATION: EXPRESSED IN HEART, SIGNAL = 2.8 

OTHER INFORMATION: EXPRESSED IN FETAL LIVER, SIGNAL =2.8 

OTHER INFORMATION: EXPRESSED IN LUNG, SIGNAL =2.4 

OTHER INFORMATION: EXPRESSED IN PLACENTA, SIGNAL =3.1 

OTHER INFORMATION: EXPRESSED IN BT474, SIGNAL =1.8 

OTHER INFORMATION: EXPRESSED IN ADULT LIVER, SIGNAL =1.7 

OTHER INFORMATION: EXPRESSED IN BRAIN, SIGNAL =2.2 

OTHER INFORMATION: EXPRESSED IN BONE MARROW, SIGNAL =2.7 

OTHER INFORMATION: SWISSPROT HIT: P52823, EVALUE 3.00e-35 

OTHER INFORMATION: EST_HUMAN HIT: AW954342.1, EVALUE 3.00e-34 
US-09-864-761-37770 



Query Match 28.2%; Score 358; DB 9; Length 70; 

Best Local Similarity 100.0%; Pred. No. 6.9e-30; 

Matches 70; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 8 GKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE 147 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 GKAFVKESLKCIANGVTSKVFLAI RRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE 60 



Qy 148 WQLPNHFSN 157 

I I I I I I I I I I 
Db 61 WQLPNHFSN 7 0 



RESULT 14 
US-10-058-270A-22 

; Sequence 22, Application US/10058270A 

; Publication No. US2004002 9114A1 

; GENERAL INFORMATION: 

; APPLICANT: Mack, David H . 

; APPLICANT: Gish, Kurt C. 

; APPLICANT: Afar, Daniel 

; APPLICANT: Eos Biotechnology, Inc. 

; TITLE OF INVENTION: Methods of Diagnosis of Breast Cancer, Compositions and 
TITLE OF INVENTION: Methods of Screening for Modulators of Breast Cancer 

; FILE REFERENCE: 018501-005210US 

; CURRENT APPLICATION NUMBER: US/10/058 , 270A 

; CURRENT FILING DATE: 2002-01-24 

; PRIOR APPLICATION NUMBER: US 60/263,965 

; PRIOR FILING DATE: 2001-01-24 

PRIOR APPLICATION NUMBER: US 60/265,928 

; PRIOR FILING DATE: 2001-02-02 

; PRIOR APPLICATION NUMBER: US 09/829,472 

; PRIOR FILING DATE: 2001-04-09 

; PRIOR APPLICATION NUMBER: US 60/282,698 

; PRIOR FILING DATE: 2001-04-09 

; PRIOR APPLICATION NUMBER: US 60/288,590 

; PRIOR FILING DATE: 2001-05-04 

; PRIOR APPLICATION NUMBER: US 60/294,443 

; PRIOR FILING DATE: 2001-05-29 

; NUMBER OF SEQ ID NOS : 141 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 22 
LENGTH: 302 

; TYPE: PRT 



; ORGANISM: Homo sapiens 
US-10-058-270A-22 



Query Match . 27.9%; Score 354; DB 12; Length 302; 

Best Local Similarity 32.4%; Pred. No. 1.5e-28; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4 

LVLVIS AS AT H EAE - QN D S VS P RK S RVAAQN S AE WRC LN S ALQVGC GAFAC L 60 

Ml:: II: I I I : I I :: I I : I I : I I : I I I I I I I 

LALVLATFDPARGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 

ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I | : : I : I : : I I : I I : : I I I I I I I : I : I : : I I I I : : : hi : I 

ENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGCISRKCPAI REM 131 



: I | | | | : : | : I : I I I : : : : I II III 



Qy 


9 


Db 


12 


Qy 


61 


Db 


72 


Qy 


121 


Db 


132 


Qy 


181 


Db 


• 192 


Qy 


223 


Db 


252 



222 



I II II III I : : I I 



I : I I III 



RESULT 15 
US-10-364-889-4 

Sequence 4, Application US/10364889 
Publication No. US20030224989A1 
GENERAL INFORMATION: 
APPLICANT: Panel, Gregory L. 
APPLICANT: Quinn, Kerry 

TITLE OF INVENTION: Compositions and Methods for Treatment of Osteoarthriti 
FILE REFERENCE: 21402-558 

CURRENT APPLICATION NUMBER: US/10/364,889 
CURRENT FILING DATE: 2003-02-12 
PRIOR APPLICATION NUMBER: 60/356,376 
PRIOR FILING DATE: 2002-02-12 
NUMBER OF SEQ ID NOS : 8 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 4 
LENGTH: 302 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-364-889-4 

Query Match 27.9%; Score 354; DB 12; Length 302; 

Best Local Similarity 32.4%; Pred. No. 1.5e-28; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4 

Qy 9 LVLVIS AS AT H EAE - QNDSVSPRKS RVAAQN S AE WRC LN S AL Q VG C GAFAC L 60 

Ml:: II: I I | : I I : : I I : I I : I I : I I I I I I I 

Db 12 LALVLATFDPARGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 



Qy 61 ENSTCDTDGMYDICKSFLYSA7VKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

| | : : I : | : : I I : I I : : I I I I I I I : I : I : : I I I I : : : I : I : I 

Db 72 ENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGCISRKCPAI REM 131 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

: : : : I M I I : : I : I : I I I : : : : I II MM: I 
Db 132 VSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 

Qy 181 SLMEKIGPNMASLFHILQ-TDHCAQTHPRADFNRRRTNEPQKL 222 

I :: I I I I I I I I I :: I I 

Db 192 SVQVQCEQNWGSLCSILSFCTSAIQKPPTAPPERQPQVDRTKLSRAHHGEAGHHLPEPSS 251 

Qy 223 KVLLRNLRGEEDSPSH 238 

: I Ml III 

Db 252 RETGRGAKGERGSKSH 2 67 

Search completed: July 19, 2004, 15:39:03 
Job time : 47 sees 



* 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



July 19, 2004, 15:29:43 



Search time 39 Seconds 
(without alignments) 
1998.282 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-614-990-2 
1268 

1 MLQNSAVLLVLVI SASATHE NLRGEEDSPSHIKRTSH.ESA 247 



Scoring table: 



Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters: 



1017041 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



SPTREMBL 25:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : * 
sp_bacteria : * 
sp_f ungi : * 
sp__human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle: * 
sp_phage : * 

sp__plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif ied: * 

sp_r virus : * 

sp_bacteriap: * 

sp_ar cheap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB ID 



Description 



1 


1199 


94. 


6 


247 


6 


Q9N0T1 


Q9n0tl bos taurus 


2 


1192. 5 


94. 


0 


246 


11 


Q7TSN9 


Q7tsn9 mus musculu 


3 


779.5 


61. 


5 


252 


13 


Q800I4 


Q800i4 lepisosteus 


4 


778.5 


61. 


4 


252 


13 


Q800I5 


Q800i5 amia calva 


5 


654 


51. 


6 


249 


13 


Q98TB7 


Q98tb7 osteoglossu 


6 


354.5 


28 . 


0 


296 


11 


Q9DCS6 


Q9dcs6 mus musculu 


7 


102 


8. 


0 


473 


10 


Q9SZ16 


Q9szl6 arabidopsis 


8 


92 


7. 


3 


1108 


11 


Q91YR4 


Q91yr4 mus musculu 


9 


' 91. 5 


7. 


2 


1068 


5 


Q8IHR6 


Q8ihr6 plasmodium 


10 


90. 5 


7. 


1 


454 


5 


Q9GV74 


Q9gv74 dictyosteli 


11 


90. 5 


7 . 


1 


783 


3 


014130 


014130 schizosacch 


12 


89.5 


7. 


1 


622 


10 


Q8H2Y0 


Q8h2y0 oryza sativ 


13 


89.5 


7. 


1 


1126 


10 


064605 


064605 arabidopsis 


14 


89.5 


7 . 


1 


3658 


10 


Q9M7K6 


Q9m7k6 arabidopsis 


15 


89.5 


7 . 


1 


9439 


16 


Q8CP76 


Q8cp76 staphylococ 


16 


89 


7 . 


0 


305 


10 


065029 


065029 linum usita 


17 


89 


7 . 


0 


996 


13 


Q7ZW18 


Q7zwl8 brachydanio 


18 


89 


7 . 


0 


1728 


10 


Q9LUI2 


Q91ui2 arabidopsis 


19 


88 


6. 


9 


473 


10 


Q9LFC2 


Q91fc2 arabidopsis 


20 


87 


6. 


9 


578 


10 


Q8LEG3 


Q81eg3 arabidopsis 


21 


86.5 


6. 


8 


290 


10 


Q8LSZ6 


Q81sz6 hevea brasi 


22 


86 


6 . 


8 


470 


16 


Q9Z8Q2 


Q9z8q2 chlamydia p 


23 


86 


6. 


8 


510 


10 


Q9LFC4 


Q91fc4 arabidopsis 


24 


86 


6. 


8 


556 


3 


Q03899. 


Q03899 saccharomyc 


25 


86 


6. 


8 


1091 


6 


Q7YRF1 


Q7yrfl canis famil 


26 


85. 5 


6. 


7 


183 


10 


Q8L5G9 


Q815g9 hevea brasi 


27 


85.5 


6. 


7 


290 


10 


Q8L5B7 


Q815b7 hevea brasi 


28 


85. 5 


6. 


7 


290 


10 


Q948Q0 


Q948q0 hevea brasi 


29 


85 . 5 


6. 


7 


447 


3 


Q7Z9L8 


Q7z918 coprinus ci 


30 


85 . 5 


6. 


7 


634 


5 


Q95PL8 


Q95pl8 trypanosoma 


31 


85 . 5 


6. 


7 


654 


16 


Q8CN27 


Q8cn27 staphylococ 


32 


85 


6. 


7 


806 


10 


Q9FFZ5 


Q9ffz5 arabidopsis 


33 


83.5 


6. 


6 


290 


10 


Q8L3V1 


Q813vl hevea brasi 


34 


83 . 5 


6. 


6 


290 


10 


Q8L5H1 


Q815hl hevea brasi 


35 


83.5 


6. 


6 


427. 


5 


Q9N3E2 


Q9n3e2 caenorhabdi 


36 


83.5 


6. 


6 


1026 


3 


074669 


074669 pneumocysti. 


37 


83. 5 


6. 


6 


1028 


3 


074668 


074 668 pneumocysti 


38 


83.5 


6. 


6 


1273 


5 


Q9BL02 


Q9bl02 caenorhabdi 


39 


83 


6. 


5 


250 


17 


Q04927 


Q04 927 methanobact 


40 


83 


6. 


5 


2.60 


10 


Q8GZS0 


Q8gzs0 lupinus alb 


41 


83 


6. 


5 


297 


12 


Q91BJ2 


Q91bj2 spodoptera 


42 


83 


6. 


5 


365 


10 


Q9SU53 


Q9su53 arabidopsis 


43 


83 


6. 


5 


398 


16 


Q8CRY1 


Q8cryl staphylococ 


44 


83 


6. 


5 


848 


6 


Q9XS94 


Q9xs94 bos taurus 


45 


83 


6. 


5 


1431 


16 


Q891P4 


Q891p4 Clostridium 



ALIGNMENTS 



RESULT 1 
Q9N0T1 

ID Q9N0T1 PRELIMINARY; PRT; 247 AA. 

AC Q9N0T1; 

DT 01-OCT-2000 (TrEMBLrel. 15, Created) 

DT ' 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE Stanniocalcin . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RA DiMattia G.E. ; 

RT "bovine stanniocalcin cDNA sequence."; 

RL Submitted (APR-2000) to the EMBL/'GenBank/DDBJ databases. 

DR EMBL; AF257506; AAF68996.1; -. 

DR GO; GO: 0005576; C : extracellular ; IEA. 

DR GO; GO: 0005524; F: ATP binding; IEA. 

DR GO; GO: 0005179; F:hormone activity; IEA. 

DR GO; GO: 0004672; F:protein kinase activity; IEA. 

DR GO; GO: 0006468; P: protein amino acid phosphorylation; IEA. 

DR InterPro; IPR000719; Proteinase. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

DR PROSITE; PS00107; P ROT E I N_K I NAS E_AT P ; 1. 

SQ SEQUENCE 247 AA; 27555 MW; F9942A7 15E2A3DE0 CRC64; 

Query Match 94.6%; Score 1199; DB 6; Length 247; 

Best Local Similarity 94.7%; Pred. No. 1.5e-105; 

Matches 234; Conservative 5; Mismatches 8; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I II I I I I I I I I I I I 
Db 1 MLQNSAVI LVLVISASATHEAEQNDSVSLRKSRVAAQNSAEVT RCLNSALQVGCGAFACL 60 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 12 0 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 61 ENSTCDTDGMYDICKSFLYSA7^FDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I I I I I I I I I : I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I 
Db 121 IAEVQEECYTKLNVCSVAKRNPEAITEWQLPNHFSNRYYNRLVRSLLDCDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

Db 181 SLMEKIGPNMASLFHILQTDHCAHTQQRADFNRRRANEPQKLKVLLRNLRGEVASPSHIK 240 



Qy 241 RTSHESA 247 

III III 
Db 241 RTSQESA 247 



RESULT 2 
Q7TSN9 

ID Q7TSN9 PRELIMINARY 
AC Q7TSN9; 

DT 01-OCT-2003 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 
DT 01-OCT-2003 (TrEMBLrel. 
DE Stanniocalcin 1. 



PRT; 246 AA. 
25, Created) 

25, Last sequence update) 
25, Last annotation update) 



OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP ' SEQUENCE FROM N.A. 

RC STRAIN=129/SvJ; 

RA Chang A. C, Cha J. , Reddel R.R.; 

RT "The murine stanniocalcin 1 gene is not essential for growth and 

RT development . " ; 

RL Submitted (MAY-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AF512563; AAP47156.1; -. 

SQ SEQUENCE 246 AA; 27423 MW; 4101C37EB6D743B2 CRC64; 

Query Match 94.0%; Score 1192.5; DB 11; Length 246; 

Best Local Similarity 93.9%; Pred. No. 6.3e-105; 

Matches 232; Conservative 7; Mismatches 7; Indels 1; Gaps 1; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I I I I I I I : I 11111:1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : : I I I I I 
Db 1 MLQNSAVI LALVI SAAAAHEAEQNDS VS PRKS RVAAQNS AEWRCLNSACRLAAG- FACL 59 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I 11 I I I I I I I I I I 

Db 60 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGITSKVFLAIRRCSTFQRM 119 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 18 0 

I I I I I I : I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 120 IAEVQEDCYSKLNVCSIAKRNPEAITEVIQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 179 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 24 0 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 180 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEGDSPSHIK 239 

«« 

Qy 241 RTSHESA 247 

III III 

Db 240 RTSQESA 246 



RESULT 3 
Q800I4 

ID Q800I4 PRELIMINARY; PRT; 252 AA. 

AC Q800I4; 

DT 01-JUN-2003 (TrEMBLrel. 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Stanniocalcin precursor. 

GN STC. 

OS Lepisosteus osseus (Long-nosed gar) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Actinopterygii ; Neopterygii; Semionotif ormes ; Lepisosteidae; 

OC Lepisosteus. 

OX NCBI_TaxID=34771; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Amemiya Y., Youson J.H.; 

RT "Primary structure of stanniocalcin in two basal Actinopterygii."; 



RL Submitted (MAR-2003) to the EMBL/GenBank/DDBJ databases, 

DR EMBL; AB106623; BAC66164.1; 

DR GO; GO: 0005576; C : extracellular ; IEA. 

DR GO; GO: 0005524; F : ATP binding; IEA. 

DR GO; GO: 0005179; F:hormone activity; IEA. 

DR GO; GO: 0004672; Frprotein kinase activity; IEA. 

DR GO; GO: 0006468; P:protein amino acid phosphorylation; IEA. 

DR InterPro; IPR000719; Prot_kinase. 

DR InterPro; IPR004978; Stanniocalcin . 

DR Pfam; PF03298; Stanniocalcin; 1.. 

DR PROSITE; PS00107; PROTEIN_KINASE_ATP; 1. 

KW Signal. 

FT SIGNAL 1 32 Potential. 

FT CHAIN 33 252 Potential. 

SQ SEQUENCE 252 AA; 27936 MW; 2C99E8 8DCCED2E33 CRC64; 



Query Match 61.5%; Score 779.5; DB 13; Length 252; 

Best Local Similarity 59.8%; Pred. No. 9.5e-66; 

Matches 152; Conservative 44; Mismatches 49; Indels 9; Gaps 3; 

Qy 1 MLQN S AVL LVL VI S AS AT H EAEQN D S VS P RKS RVAAQN S AE WRC LN S ALQVGCGAFAC L 60 

II: : : I I : I I : II : I : I I : I I I I : : I I : : : : I I I I I I I I I I I I I I I I I I 

Db 1 MLRKTGILLLLVLLTSA-YELDQNESFSPRRTRVSTHSPSDVARCLNSALQVGCGAFACL 59 

Qy 61 EN STCDT DGMYDI CKS FLYSAAKFDTQGKAFVKES LKCI ANGVT S KVFLAI RRCST FQRM 120 

I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I MINIUM 
Db 60 ENSTCDTDGMHDICKSFLYSAAKFDTQGKVFVKESLKCIANGITSKVFLTIRRCSTFQRM 119 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I : I II I I I I I I I : : I Ml I I M I M I M I M I I I I I : M : M I M I I : M I I M 
Db 120 ISEVQEECYSKLDICGVAKLNPDAISEVAQLPSHFPNRYYSKLLQSLMECDDETVSLVRS 179 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRAD FNRRRTNEPQKLKVLLRNLRGEE 233 

I I : : M I I I I I M I : I : : II I I : I I I I 

Db 180 SLVSRLGPEMAMLFQLLQSKPCPSSSSLSSPAGAEGRGNWRWPIGPHVFK-MQPNLRREP 238 

Qy 234 DSPSHIKRTSHESA 247 

: II:: M : 

Db 239 STLFSKKRSADDSS 252 



RESULT 4 
Q800I5 



ID Q800I5 PRELIMINARY; PRT; 252 AA. 

AC Q800I5; 

DT 01-JUN-2003 (TrEMBLrel . 24, Created) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Stanniocalcin precursor. 

GN STC . 

OS Amia calva (Bowfin) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Amiiformes; Amiidae; Amia. 

OX N,CBI_TaxID=7924; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Amemiya Y., Youson J.H.; 



RT "Primary structure of stanniocalcin in two basal Actinopterygii . " ; 

RL Submitted (MAR-2 003) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AB106622; BAC66163.1; 

DR GO; GO: 0005576; C : extracellular ; IEA. 

DR GO; GO: 0005524; F: ATP binding; IEA. 

DR GO; GO: 0005179; F: hormone activity; IEA. 

DR GO; GO: 0004672; F:protein kinase activity; IEA. 

DR GO; GO: 0006468; P:protein amino acid phosphorylation; IEA. 

DR InterPro; IPR000719; Proteinase. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

DR PROSITE; PS00107; P ROT E I N_K I N AS E__AT P ; 1. 

KW Signal. 

FT SIGNAL 1 32 Potential. 

FT CHAIN 33 252 Potential. 

SQ SEQUENCE 252 AA; 27868 MW; B9D9663610DF0B5D CRC64; 



Query Match 61.4%; Score 778.5; DB 13; Length 252; 

Best Local Similarity 69.8%; Pred. No. 1.2e-65; 

Matches 141; Conservative 37; Mismatches 23; Indels 1; Gaps 1; 

Qy 1 MLQNS AVLLVLVI S AS AT H EAEQND S VS PRKS RVAAQN S AEWRC LNS ALQVGCGAFACL 60 

I I : : I I : I I : II : I : I I = I I I I :: M : I : I :: I I I I I I I I I I I I I I I I I I 
Db 1 MLHKTGI LLLLVLLTSA- YELDQNES FS PRRTRVSAHS S SDVARCLNS ALQVGCGAFACL 59 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 12 0 

I I I II I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I 
Db 60 ENSTCDTDGMHDICKSFLYSAAKFDTQGKVFVKESLKCIANGITSKVFLTIRRCSTFQRM 119 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I : I I I I I I I : I I : : I : I : I I I I I : I I I I I : I I I I I I : : I : : I I : I I I I : I I I : I 
Db 120 ISEVQEECYTKLDICEVARLNPEAISEVAQLPSHFPNRYYSKLLQSLMECDEETVSWRS 179 

Qy 181 SLMEKIGPNMASLFHILQTDHC 202 

: I : : : I I I : I I : I I : I 
Db 180 NLVSRLGPEMSMLFQLLQSKPC 201 



RESULT 5 
Q98TB7 



ID Q98TB7 PRELIMINARY; PRT; 249 AA. 

AC Q98TB7; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Stanniocalcin precursor. 

GN STC. 

OS Osteoglossum bicirrhosum (silver arawana) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Osteoglossomorpha ; 

OC Osteoglossif ormes ; Osteoglossidae; Osteoglossum. 

OX NCBI_TaxID=109271; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Stannius corpuscules; 

RA Amemiya Y., Marra L.E., Reyhani N., Youson J.H.; 

RT "Stanniocalcin from an ancient teleost. Evidence for a monomeric form 



RT of the hormone and an extracorpuscular distribution."; 

RL Submitted (APR-2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AB060558; BAB43868.1; 

DR GO; GO: 0005576; C : extracellular ; IEA. 

DR GO; GO: 0004108; F: citrate ( Si ) -synthase activity; IEA. 

DR GO; GO: 0005179; F:hormone activity; IEA. 

DR GO; GO: 0006099; P : tricarboxylic acid cycle; IEA. 

DR InterPro; IPR002020; Citrate_synth . 

DR InterPro; IPR004978; Stanniocalcin . 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Signal. 

FT SIGNAL 1 31 POTENTIAL . 

FT CHAIN 32 249 POTENTIAL. 

SQ SEQUENCE 249 AA; 27389 MW; 102 97D054A2BEB3A CRC64; 



Query Match 51.6%; Score 654; DB 13; Length 249; 

Best Local Similarity 57.5%; Pred. No. 7.5e-54; 

Matches 122; Conservative 43; Mismatches 45; Indels 2; Gaps 1; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

I : I : I I : -III : I I : I I I : : : I I I I : I I I I I I I I I I I 

Db 1 MIQKCMLLLLFFLTASAF — VI DQ E E P S T RT ARFAAN S L S DVARC L S GALQVG C S AFAC L 58 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRM 120 

I I I I I I I I I I :: I I I : I I I : I I I I II I I I I I I I I I : I : I I I : I I I II : I I I I I I I I 
Db 59 ENSTCDTDGMHEICKTFLYTAAKFDTQGKTFVKESLRCMANGITSKGFLMVRRCSTFQSM 118 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

: I : I I I : I I : I I : : I : I : I I I I I II I I I I I I I : I : I : : I I I I I I : : I I I : I I 
Db 119 LADVQEDCYNKLDLCGVARANPEAIGEVAQLPNSFPNRHYSTLLQSLLECDQETVSLVRD 178 

Qy 181 S LME K I G PNMAS LFHILQTDH C AQT H P RAD FN 212 

I : : : I I : I I I : I I : : : : I I 
Db 179 SMSARLGPEVAML FKLLQS S S RS GSAAQASNN 210 



RESULT 6 
Q9DCS6 



ID Q9DCS6 PRELIMINARY; PRT; 296 AA. 

AC Q9DCS6; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Stanniocalcin 2 . 

GN STC2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=100 90; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Kidney; 

RX MEDLINE-21085660; PubMed=11217851 ; 

RA Kawai J., Shinagawa A., Shibata K., Yoshino M. , Itoh M. , Ishii Y. , 

RA Arakawa T . , Hara A., Fukunishi Y. , Konno H., Adachi J., Fukuda S., 

RA Aizawa K., Izawa M. , Nishi K., Kiyosawa H., Kondo S., Yamanaka I., 

RA Saito T., Okazaki Y., Gojobori T., Bono H., Kasukawa T., Saito R. , 



RA Kadota K., Matsuda H.A., Ashburner M. , Batalov S., Casavant T . , 

RA Fleischmann W., Gaasterland T . , Gissi C, King B., Kochiwa H., 

RA Kuehl P., Lewis S., Matsuo Y., Nikaido I., Pesole G., Quackenbush J., 

RA Schriml L.M., Staubli F., Suzuki R. , Tomita M. , Wagner L., Washio T., 

RA Sakai K. , Okido T., Furuno M. , Aono H., Baldarelli R. , Barsh G., 

RA Blake J., Boffelli D., Bojunga N., Carninci P., de Bonaldo M.F., 

RA Brownstein M.J., Bult C, Fletcher C, Fujita M. , Gariboldi M. , 

RA Gustincich S., Hill D . , Hofmann M. , Hume D . A. , Kamiya M. , Lee N.H., 

RA Lyons P., Marchionni L., Mashima J., Mazzarelli J., Mombaerts P., 

RA Nordone P., Ring B., Ringwald M; , Rodriguez I. f Sakamoto N., 

RA Sasaki H. , Sato K. , Schoenbach C. f Seya T., Shibata Y., Storch K.-F., 

RA Suzuki- H., Toyo-oka K. , Wang K.H., Weitz C, Whittaker C, Wilming L., 

RA Wynshaw-Boris A. , Yoshida K. , Hasegawa Y., Kawaji H., Kohtsuki S., 

RA Hayashizaki Y. ; 

RT "Functional annotation of a full-length mouse cDNA collection."; 



RL Nature 409:685-690(2001). 

DR EMBL; AK002527; BAB22164.1; -. 

DR MGD; MGI: 1316731; Stc2. 

DR GO; GO: 0005576; C : extracellular ; IEA. 

DR GO; GO: 0005179; F:hormone activity; IEA. 

DR InterPro; IPR004978; Stanniocalcin . 

DR Pfam; PF03298; Stanniocalcin; 1. 

SQ SEQUENCE 296 AA; 32488 MW; BD 8 96 1 CEFFA4 7508 CRC64; 

Query Match 28.0%; Score 354.5; DB 11; Length 296; 

Best Local Similarity 31.7%; Pred. No. 2.3e-25; 

Matches 85; Conservative 45; Mismatches 109; Indels 29; Gaps 5; 

Qy 7 VLLVLVI SASATHEAEQNDSVSP RKS RVAAQN SAEWRCLNS ALQVGCGA 56 

I I I I : | : | | : | : | | : : | | : | | : I I : I I I I I 

Db 10 VTLALVF — ATLDPAQGTDSTNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGV 67 

Qy 57 FACLENSTCPTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCI7^GVTSKVFLAIRRCST 116 

I I I I : : I : I : : I I : I I : : I I I I I I I : I : I : : I : I I : : I I : I 

Db 68 FECFENNSCEIQGLHGICMTFLHNAGKFDAQGKSFIKDALRCKAHALRHKFGCISRKCPA 127 

Qy 117 FQRMIAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVS 176 

: I : : : I I I I I : : I I I : I II":: : : I II II I II 
Db 128 I REMVFQLQRECYLKHDLCSAAQENVGVI VEMI HFKDLLLHEPYVDLVNLLLTCGEDVKE 187 

Qy 177 TIRDSLMEKIGPNMASLFHIL QTDHCAQTH- PRADFNRRRTNEPQKLKV 224 

: I : : : I I I : II I I : I : 

Db 188 AVTRSVQAQCEQSWGGLCSILSFCTSNIQRPPTAAPEHQPLADRAQLSRPHHRDTDHHLT 247 



Qy 225 LLRNLRGEEDSPSHIK RTSHESA 247 

I : I I III I I : II 

Db 248 ANRGAKGERGSKSHPNAHARGRTGGQSA 275 



RESULT 7 
Q9SZ16 

ID Q9SZ16 PRELIMINARY; PRT; 473 AA. 

AC Q9SZ16; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 
DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 
DT 01-OCT-2003 (TrEMBLrel. 25 f Last annotation update) 
DE Hypothetical protein. 



GN F10M23.100 OR AT4G26760. 

OS Arabidopsis thaliana (Mouse-ear cress) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; rosids; 

OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

OX NCBI_TaxID=3702; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Bevan M. , Lecharny A. , Chefdor F. , Krivitzky M. , Kreis M. , 

RA Hoheisel J., Mewes H.W., Mayer K.F.X., Schueller C; 

RL Submitted (FEB-1999) to the EMBL/GenBarik/ DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA EU Arabidopsis sequencing project; 

RL Submitted (MAY-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Lecharny A., Chefdor F., Krivitzky M. , Kreis M. , Mewes H.W., 

RA Lemcke K., Mayer K.F.X.; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A.. 

RA EU Arabidopsis sequencing project; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AL035440; CAB36522.1; -. 

DR EMBL; AL161565; CAB79531.1; 

DR PIR; T04799; T04799. 

DR InterPro; IPR007145; MAP65_ASE1. 

DR Pfam; PF03999; MAP65_ASE1; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 473 AA; 53737 MW; 8CB8B8 83CF5CCCD1 CRC64 ; 



Query Match 8.0%; Score 102; DB 10; Length 473; 

Best Local Similarity 21.0%; Pred. No. 0.36; 

Matches 50; Conservative 43; Mismatches 67; Indels 78; Gaps 11; 

Qy 2 6 SVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDT DGMYDICKSF-LYS 8 0 

: I : I : I I : : I I : : : I : I : I I I I I I I I : 

Db 50 NVYKKKVELAAKSRAELLQTLSD ATVELSNLTTALGEKSYIDICDSMSLFP 100 

Qy 81 AAKFDTQGKAFVKESLKCIA NG — 102 

11:11111 II 
Db 101 LQPDKTSGT— IKEQLSAIAPALEQLWQQKEERVRAFSDVQSQIQKICEEIAGGLNNGPH 158 

Qy 103 VTSKVFLAIRRCSTFQRMIAEVQEECYSKL NVCSIAKRN-PEAITEV — 14 8 

I : I : : : I I I I : I : I : I : I : : I : : : : : I I I 

Db 159 WDETDLSLKRLDDFQRKLQELQKEKSDRLQKVLEFVSTVHDLCAVLRLDFLSTVTEVHP 218 

Qy 149 VQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHILQT 199 

II : II I I : : : I I I : : : : : : : : I : : : : I 

Db 219 SLDEANGVQTKS-ISNETLARLAKTVLTLKEDKMQRLKK— LQELATQLTDLWNLMDT 273 



RESULT 8 
Q91YR4 

ID Q91YR4 PRELIMINARY; PRT; 1108 AA. 

AC Q91YR4; 



DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to adaptor-related protein complex AP-3, beta 1 subunit. 

GN AP3B1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; BC015068; AAH15068.1; -. 

DR MGD; MGI: 1333879; Ap3bl. 

DR GO; GO: 0008237; F:metallopeptidase activity; IEA. 

DR GO; GO: 0008270; F:zinc ion binding; IEA. 

DR GO; GO: 0006508; P ^proteolysis and peptidolysis ; IEA. 

DR InterPro; IPR002553; Adaptin_N. 

DR InterPro; IPR008938; ARM. 

DR InterPro; IPR006025; Pept_M_Zn_BS . 

DR Pfam; PF01602; Adapt in_N; 1. 

DR PROSITE; PS00142; ZINC_PROTEASE; 1. 

SQ SEQUENCE 1108 AA; 123155 MW; 7B436C3E32C3E2E3 CRC64 ; 

Query Match 7.3%; Score 92; DB 11; Length 1108; 

Best Local Similarity 21.2%; Pred. No. 9; 

Matches 55; Conservative 40; Mismatches 104; Indels 60; Gaps 11; 

Qy 1 MLQN S AVLLVLVI S AS AT H EAEQN D S VS P RKS RVAAQN S AEWRCLN S ALQVGC GAFACL 60 

: I I : : I : : : I : I I : I I : I I I I : I : 

Db 307 LLQSRNAAWMAVAQLYWH 1 S P K S E - - AGVI S K S L VRL L RS N RE VQ Y I VLQN I 357 

Qy 61 ENSTCDTDGMYD-ICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFL- 109 

: : I I : : Mil': II I I : I I I 

Db 358 ATMSIERKGMFEPYLKSF-YVRSTDPTMIKTLKLEILTNLANEANISTLLREFQTYVRSQ 416 

Qy 110 AIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE— WQLPNHFS 156 

I I I : I I : I I : I : I I I : : I : : I I : : 

Db 417 D KQ FAAAT I QT I GRC AT SISEVTDTCLNGL-VCLLSNRDEIWAESVWIKKLLQMQ 472 

Qy 157 NRYYNRLVRSLLE-CDEDTVSTIRDSLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRR 215 

: : : I : : I I I I I : : I I I I : I 
Db 473 PAQHGEIIRHMAKLLDSITVPVARASILWLIGEN CERVPKIA 514 



Qy " 216 TNEPQKLKVLLRNLRGEED 234 

I I : : : : I : I 
Db 515 PDVLRKMAKSFTSEDD 530 



RESULT 9 
Q8IHR6 

ID Q8IHR6 . PRELIMINARY; PRT; 1068 AA. 

AC Q8IHR6; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 



DE Coat protein, gamma subunit, putative. 

GN PF11_04 63. 

OS Plasmodium falciparum (isolate 3D7) . 

OC Eukaryota; Alveolata; Apicomplexa; Haemosporida; Plasmodium. 

OX NCBI_TaxID=36329; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=3D7; 

RX MEDLINE-22255705; PubMed=12368864 ; 

RA Gardner M.J., Hall N., Fung E., White 0., Berriman M. , Hyman R.W., 

RA Carlton J.M., Pain A. , Nelson K.E., Bowman S., Paulsen I.T., James K., 

RA Eisen J. A. , Rutherford K., Salzberg S.L., Craig A., Kyes S., 

RA Chan M.-S., Nene V., Shallom S.J., Suh B. , Peterson J., Angiuoli S., 

RA Pertea M. , Allen J., Selengut J., Haft D., Mather M.W., Vaidya A.B., 

RA Martin D.M.A., Fairlamb A.H. f Fraunholz M.J., Roos D.S., Ralph S.A., 

RA McFadden G.I., Cummings L.M., Subramanian G.M. , Mungall C, 

RA Venter J.C., Carucci D.J., Hoffman Newbold C, Davis R.W., 

RA Fraser C.M. f Barrell B.; 

RT "Genome sequence of the human malaria parasite Plasmodium 

RT falciparum."; 

RL Nature 419:498-511(2002). 

DR EMBL; AE014843; AAN36043.1; 

DR InterPro; IPR002553; Adaptin_N. 

DR InterPro; IPR008938; ARM. 

DR Pfam; PF01602; Adaptin_N; 1. 

SQ SEQUENCE 1068 AA; 124368 MW; 0710F427B066B12F CRC64 ; 

Query Match 7.2%; Score 91.5; DB 5; Length 1068; 

Best Local Similarity 22.3%; Pred. No. 9.6; 

Matches '45; Conservative 31; Mismatches 53; Indels 73; Gaps 11; 

Qy . 37 QNSAEWRCLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESL 96 

: I : : : I : : I : I I I I I I I I I : I I I 
Db 581 KNPSKYIRYIYNRL LLENSTIRIDGMY ALFYIA— L 614 

Qy 97 KCIANGVTSKVFLAI RRCSTFQRMIAEVQEECYSKLN — VCSIAKRNPEAITEWQLPN- 153 

III I I I : I ::|: :| : I : :: | | |: | 

Db 615 KCAEN SKDILVLLNC LLADNDDEVRDRTNFFYYMLKEKIKELDTSNKQISNE 666 

Qy 154 HFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHI 196 

I : I I I I : I I I I : : I I : I : 

Db 667 YEEKLQNNENINEHNNIHYSN NNLIDQLLEYDQNT— NIDQLLYF 709 

Qy 197 LQTDHCAQTHPRADFNRRRTNE 218 

: :: I : I : : I : : I 
Db 710 I-SNH-IEKDPKEEFSYQHVKE 729 



RESULT 10 
Q9GV74 

ID Q9GV74 PRELIMINARY; PRT; 454 AA. 

AC Q9GV7.4; 

DT 01-MAR-2001 (TrEMBLrel. 16, Created) 

DT 01-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Differentiation associate-1. 

GN DIA-1. 



OS Dictyostelium discoideum (Slime mold) . 

OC Eukaryota; Mycetozoa; Dictyosteliida ; Dictyostelium. 

OX NCBI_TaxID-44 689 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20347020; PubMed-10887 082 ; 

RA Hirose S., Inazu Y., Chae S., Maeda Y. ; 

RT "Suppression of the growth/dif f erentiation transition in Dictyostelium 

RT development by transient expression of a novel gene, dial."; 

RL Development 127:3263-3270(2000). 

DR EMBL; AB007026; BAB13513.1; -. 

SQ SEQUENCE 454 AA; 48601 MW; 529D28B2D1908B85 CRC64; 

Query Match 7.1%; Score 90.5; DB 5; Length 454; 

Best Local Similarity 21.1%; Pred. No. 4.3; 

Matches 46; Conservative 42; Mismatches 93; Indels 37; Gaps 11; 

Qy 45 CLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQGK AFVKESLKCIA 100 

I II I I : : : I I : I I I : I I I : I : : : I I : : I : : I 

Db 22 7 CLNGANGYSCKSY—VENQSCDPSDEYPVCNSD-YQSCKCNSKGKGSCQSYYKLTQEC— 281 

Qy 101 N GVT S KVFLAI RRC S T FQ RMI AE VQEECYSKLNVCSIAKRNPEAITEWQLPNHF 155 

: : I : I I : : I I I hi I : : : : I : : I 

Db 282 KDSSNKLVL CAKSKNSIPSYKDYVTQINCQSQL — CNYSR DCIDPKAKVSTCF 332 

Qy 156 SN RYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHILQTDHCAQTHPRA 209 

: : III : I : I | | : : : : | : : : : : 

Db 333 NDLFLMCPRYYQEPEIGSSSSSSSSSSSSGSSSNSNIFINIINIINIIRSSE-SSNGSNS 391 

Qy 210 DFNRRRTNEPQKLKVLLRNLRGEEDSPSHIKRTSHESA 247 

: : : I I III Mil: 

Db 392 NSVSSESSSPSSSSV ESSSNSKSNHTSSESS 422 



RESULT 11 
014130 

ID 014130 PRELIMINARY; PRT; 783 AA. 

AC 014130; 

DT 01-JAN-1999 (TrEMBLrel. 09, Created) 

DT 01- JAN-1999 (TrEMBLrel. 09, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Putative regulatory protein, zinc finger. 

GN SPAC3C7 . 04 . 

OS , t Schizosaccharomyces pombe (Fission yeast) . 

OC Eukaryota; Fungi; Ascomycota; Schizosaccharomycetes ; 

OC Schizosaccharomycetales ; Schizosaccharoraycetaceae; 

OC Schizosaccharomyces. 

OX NCBI_TaxID=4896; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=972h-; 

RA Brown D., Churcher CM.; 

RL Submitted (SEP-1997) to the EMBL/GenBank/ DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=972h~; 

RA Wood V. , Barrell B.G., Rajandream M.A. ; 



RL 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
SQ 



Submitted (SEP-1997) to the EMBL/ GenBank/DDBJ databases. 
-!- SUBCELLULAR LOCATION: NUCLEAR (BY SIMILARITY). 

-!- SIMILARITY: CONTAINS 1 ZN (2) -CYS ( 6) , FUNGAL-TYPE BINUCLEAR CLUSTER 
DOMAIN. 

EMBL; Z99568; CAB16735.1; -. 
PIR; , T38690; T38690. 
HSSP; P12351; 1HWT. 
GeneDB_SPombe; SPAC3C7.04; -. 
GO; GO: 0005634; C:nucleus; IEA. 

GO; GO: 0003700; F: transcription factor activity; IEA. 
GO; GO: 0008270; F:zinc ion binding; IEA. 

GO; GO: 0006355; P: regulation of transcription, DNA-dependent ; IEA. 

GO; GO: 0006350; P : transcription; IEA. 

InterPro; IPR007219; Fungal_trans . 

InterPro; IPR001138; Fungi_TrN. 

Pfam; PF04082; Fungal_trans ; 1. 

Pfam; PF00172; Zn_clus; 1. 

SMART; SM00066; GAL 4 ; 1. 

PROSITE; PS00463; ZN2_CY6_FUNGAL__1 ; 1. 

PROSITE; PS5004 8; ZN2_CY6_FUNGAL_2 ; 1. 

DNA-binding; Metal-binding; Nuclear protein; Transcription; 
Transcription regulation; Zinc. 

SEQUENCE 783 AA; 87957 MW; 4DD04F3865C44 8D2 CRC64; 



Query Match 7.1%; Score 90.5; DB 3; 

Best Local Similarity 23.3%; Pred. No. 8.2; 
Matches 56; Conservative 34; Mismatches 87; 



Length 783; 
Indels 63; 



Gaps 



Qy 

Db 

QY 
Db 

Qy 

Db 

Qy 

Db 



12; 
57 



10 VLVI SASATHEAEQNDSVSPRKSRV AAQN SAEWRC LN S ALQVGC GAF 

: : : : I : I III I : I I : : : I I : I : I : I 

535 IIIMSRPVLLHKMfvNAKNSPRVDRINEDCILAARHLISLVHLLQNHSQLSCYSFFDYNYT 594 



58 



99 



ACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCI 

I: 1:1 II : Ml : :| I : I , 

595 FSSALWLLHCV-TEPCEED DIAMQYAYSALDYMAEGNEAAKNCARVIRLFDAHLKG 650 

100 AN GVT S KV- FLAI RRCST FQRMI AEVQ EECYSKLNVCSIAKRNPEAITEW 149 

MM: | : | : I I I I I I I : I I II : : I 

651 ARSDGNGNTSQSGFMA WQRWIAEVSAKDEPEKLMSPYNKSIGGGRNSNSLTPNA 704 

150 QLPNHFSNRYYNR LVRSLLECDEDTVSTIRDSLMEKIGPNMA SLFHILQTD 200 

I I : : | | | : | : : : | : | | : : | | | : | | 

7 05 NLGADVS — FFPTDDTSFLLDHSKLDDDLEKFASTLDPI — KTTPDLANDSSLLNWANTD 7 60 



RESULT 
Q8H2Y0 



12 



ID 
AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OC 



Q8H2Y0 PRELIMINARY; PRT; 622 AA. 

Q8H2Y0; 

01-MAR-2003 (TrEMBLrel . 
01-MAR-2003 (TrEMBLrel. 
01-MAR-2003 (TrEMBLrel . 
Putative transcription 
OJ1113_E01.11. 

Oryza sativa (japonica cultivar-group) . 

Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 
Spermatophyta; Magnoliophyta ; Liliopsida; Poales; Poaceae; 



23, Created) 

23, Last sequence update) 
23, Last annotation update) 
repressor HOTR. 



OC Ehrhartoideae; Oryzeae; Oryza. 

OX NCBI_TaxID=39947; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare; 

RA Sasaki T. , Matsumoto T . , Katayose. Y. ; 

RT "Oryza satlva nipponbare (GA3 ) genomic DNA, chromosome 7, BAC 

RT clone :OJ1113_E0i. "; 

RL Submitted (JUN-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AP005437; BAC20898.1; -. 

SQ SEQUENCE 622 AA; 68669 MW; 88302DE4 15DECDC9 CRC64 ; 



Query Match 7.1%; Score 89.5; DB 10; Length 622; 

Best Local Similarity 23.8%; Pred. No. 7.8; 

Matches 41; Conservative 36; Mismatches 72; Indels 23; Gaps 8; 

Qy 58 ACLENSTCDTDGMYDICKSFLYSAAKFDTQG— KAFVKESLKC-IANGVTSKVFLAIRRC 114 

I : I : : I : I I I : III : I I I : I I : : I : : 
Db 415 API EDGKQET S EVI DICE AKKS DNS ACTNKVI S GS KKCQLHNGCKAEEFVS 465 

Qy 115 STFQRMIAEVQEECYSKLNVCSIAKRNPEAIT — EWQLPNHFSNRYYNRLVRSLLECDE 172 

I : I : I I I : : : I : I I : : I I I : I : : I : 

Db 466 SRVIDLLQNE — EKVKSMTVDKLSGEEISHGKYQSQENQPSGRMWFELIK — LQNPT 518 

Qy 173 DTVSTIRDSLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKV 224 

|:|: ::: |:|:: I : I :| I I I I : : :|| 

Db 519 STLS SKGQGRQKRVTGNVAAI CEALTDNRCRET I PMA — GRERCDAHEGIKV 568 



RESULT 13 
064605 



ID -O64605 PRELIMINARY; PRT; 1126 AA. 

AC 064605; 

DT 01-AUG-1998 (TrEMBLrel. 07, Created) 

DT 01-AUG-1998 (TrEMBLrel. 07, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE F1707.15 protein. 

GN F1707.15. 

OS Arabidopsis thaliana (Mouse-ear cress) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicots; rosids; 

OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

OX NCBI_TaxID=3702; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^cv. Columbia; 

RA Vysotskaia V.S., Schwartz J.R., Toriumi M. , Yu G., Kwan A., Oji O., 

RA Liu S., Li J., Araujo R., Au M. , Brendel V., Buehler E., Conway A.B., 

RA Conway A.R., Dewar K., Feng J., Kim C, Kurtz D., Li Y. , Palm C.J., 

RA Shinn P., Sun H., Davis R.W. , Ecker J.R., Federspiel N.A. , 

RA Theologis A. ; 

RT "Arabidopsis thaliana chromosome 1 BAC F1707 sequence."; 

RL Submitted (DEC-1997) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Columbia; 

RA Theologis A. ; 



RL Submitted (JUN-1998) to the EMBL/ GenBank/DDBJ databases. 

. DR EMBL; AC003671; AAC18813.1; -. 

DR PIR; T01491; T01491. 

DR GO; GO: 0005622; C: intracellular ; IEA. 

DR GO; GO: 0004842; F: ubiquitin-protein ligase activity; IEA. 

DR GO; GO: 0006512; P:ubiquitin cycle; IEA. 

DR InterPro; IPR000569; HECT_domain. 

DR Pfam; PF00632; HECT; 1. 

DR SMART; SM00119; HECTc; 1. 

DR PROSITE; PS50237; HECT; 1. 

SQ SEQUENCE 1126 AA; 126148 MW; 0 DAI A3 AC F 6A8 F 6 4 E CRC64; 

Query Match 7.1%; Score 89.5; DB 10; Length 1126; 

Best Local Similarity 22.1%; Pred. No. 16; 

Matches 46; Conservative 39; Mismatches 76; Indels 47; Gaps 8; 

SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCD 66 

I : : I I 11:11 III: III: : : I : 

SSETQKDAESELSVARRK NCAEL YNIFLQLP QSDLCNLCMLLGY 455 

TDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQRMIAE 123 

: I : I : I II : I : I : I : I : : I : I I I s I 



I I I : I :l 1:1: : - : I 2 . I I : : — • 

-SQNTCSMA GASILRVLQVLSSLTSTIDDSNVGTDKETDQEEQNIMQGLKV 562 



Qy 


14 


Db 


412 


Qy 


67 


Db 


456 


Qy 


124 


Db 


513 


Qy 


180 


Db 


563 



-DSLMEKIGPNMASLFHILQTDHCAQT 205 
: I ::: I :: I I I I I I 



RESULT 14 
Q9M7K6 

ID Q9M7K6 PRELIMINARY; PRT; 3658 AA. 

AC Q9M7K6; 

DT 01-OCT-2000 {TrEMBLrel. 15, Created) 

DT 01-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Ubiquitin-protein ligase 2. 

GN UPL2 . 

OS Arabidopsis thaliana (Mouse-ear cress) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; rosids; 

OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

OX NCBI_TaxID=37 02; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Columbia; 

RX MEDLINE=20040063; PubMed=10571878 ; 

RA Bates P.W., Vierstra R.D.; 

RT "UPL1 and 2, two 405 kDa ubiquitin-protein ligases from Arabidopsis 

RT thaliana related to the HECT-domain protein family."; 

RL Plant J. 20:183-195(1999). 

DR EMBL; AF127565; AAF36455.1; -. 

DR GO; GO: 0005622; C : intracellular ; IEA. 



DR GO; GO: 0016874; F:ligase activity; IEA. 

DR GO; GO: 0004842; F: ubiqui tin-protein ligase activity; IEA. 

DR GO; GO: 0006512; P:ubiquitin cycle; IEA. 

DR InterPro; IPR008938; ARM. 

DR InterPro; IPR000569; HECT_domain . 

DR InterPro; IPR000449; UBA_domain. 

DR InterPro; IPR003903; UIM. 

DR Pfam; PF00632; HECT; 1. 

DR Pfam; PF00627; UBA; 1. 

DR Pfam; PF02809; UIM; 1. 

DR SMART; SM00119; HECTc; 1. 

DR SMART; SM00165; UBA; 1. 

DR SMART; SM00726; UIM; 1. 

DR PROSITE; PS50237; HECT; 1. 

DR PROSITE; PS50030; UBA; 1. 

DR PROSITE; PS50330; UIM; 1. 

KW Ligase. 

SQ SEQUENCE 3658 AA; 403582 MW; 1FB4F8EB8C1F73D1 CRC64; 

Query Match 7.1%; Score 89.5; DB 10; Length 3658; 

Best Local Similarity 22.1%; Pred. No. 65; 

Matches 46; Conservative 39; Mismatches 76; Indels 47; Gaps 8; 

Q y 14 SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCD 66 

| : : M I I : I I III: III: : : I : 

Db 2977 SSETQKDAESELSVARRK NCAEL YNIFLQLP QSDLCNLCMLLGY 3020 

Qy ' 67 ---TDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAE 123 

: | : | : | I I : I : I : I : I : : I : I I I : I 

Db 3021 EGLSDKIYSLAGEVLKKLAAVDVTHRKFFTKELSELASGLSSSTVRVLATLSTTQKM 3077 

Qy 124 VQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIR—-— 179 

I I I : I : I I : I : : : : : I : II- : : : 

Db 307 8 SQNTCSMA GASILRVLQVLSSLTSTIDDSNVGTDKETDQEEQNIMQGLKV 3127 

Qy 180 — DSLMEKIGPNMASLFHILQTDHCAQT 205 

: I : : : I : : I I II I I 

Db 3128 ALEPLWQELGQCI S - -MT ELQLDHTAAT 3153 



RESULT 15 
Q8CP76 

ID Q8CP7 6 PRELIMINARY; PRT; 943 9 AA. 

AC Q8CP7 6; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE EbhA protein. 

GN SE1128. 

OS Staphylococcus epidermidis. 

OC Bacteria; Firmicutes; Bacillales; Staphylococcus. 

OX NCBI_TaxID=1282; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN-ATCC 12228; 

RA Zhang Y. , Ren S., Li H., Fu G. , Lu L., Lu G. , Jia J. , Tu Y., Qin Z., 

RA Chen Z., Wen Y. ; 



RL Submitted (NOV-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AE016747; AAO04725.1; 

DR InterPro; IPR008454; CnaJB. 

DR InterPro; IPR002988; GA. 

DR Pfam; PF05738; Cna_B; 1. 

DR Pfam; PF01468; GA; 39. 

KW Complete proteome. 

SQ SEQUENCE 9439 AA; 1050771 MW; 5A8C68E9AA31CDD4 CRC64; 

Query Match 7.1%; Score 89.5; DB 16; Length 9439; 

Best Local Similarity 22.0%; Pred. No. 2e+02; 

Matches 50; Conservative 38; Mismatches 78; Indels 61; Gaps 11; 

Qy 17 ATHEAEQ NDSVSPRKSRVA AQNSAEWRCLNSALQ 51 

I : I I I : I I : I : I : I : I : I : I I 
Db 2190 AKNEAERI LGND — NPQVSQVTQALNKIKAIQPKLTEAINMLQNKENNTELVNAKNR 2244 

Qy 52 VGCGAFACLENSTCDTDGMYDICKSFL — YSAAKFDTQGKAFVKESLKCIANG 102 

|||:||| : : : : I : I I : I : : : : : III 
Db 2245 LENAVNDTDPTHGMTQETINNYNAKKREAQNE — IQKANMIINNGDATAQDI 2294 

Qy 103 VTSKVFLAI RRCSTFQRMIAEVQEECYSKL — NVCSIAKRNPEAITEWQLPNHF 155 

: I I : : I : I : I I I I : I : I : I : 

Db ■ 2295 S S EKS KVEQVLQALQNAKNDLRADKRELQTAYNKLIQNVNTNGKK- PS S IQN YKSARRNI 2353 

Qy 156 SNRY — YNRLVRSLLECDEDTVSTIRDSL — MEKIGPNMASLFHILQ 198 

|: I ::|| | | : : I : I : I I : :|ll 

Db 2354 ENQYNTAKNEAHNVLENTNPTVNAVEDALRKINAIQPEVTKAINILQ 2400 



Search completed: July 19, 2004, 15:33:30 
Job time : 42 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: July 19, 2004, 15:29:18 ; Search time 13 Seconds 

(without alignments) 
989.333 Million cell updates/sec 

Title: US-10-614-990-2 
Perfect score: 1268 

Sequence: 1 MLQNSAVLLVLVT S AS AT HE NLRGEEDSPSHIKRTSHESA 247 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 141681 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SwissProt_42 : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1268 


100. 


0 


247 


1 


STC1 HUMAN 


P52823 


homo sapien 


2 


1234 


97. 


3 


247 


1 


STC1_M0USE 


055183 


mus musculu 


3 


1231 


97. 


1 


247 


1 


STC1_RAT 


P97574 


rattus norv 


4 


682 


53. 


8 


250 


1 


STC ANGAU 


P18301 


anguilla au 


5 


662.5 


52. 


2 


256 


1 


STC ONCKI 


Q08264 


oncorhynchu 


6 


662.5 


52. 


2 


256 


1 


■ STC_ONCMY 


P43648 


oncorhynchu 


7 


617 


48. 


7 


179 


1 


STC ONCKE 


P43647 


oncorhynchu 


8 


354.5 


28. 


0 


296 


1 


STC2_MOUSE 


088452 


mus musculu 


9 


354 


27. 


9 


302 


1 


STC2 HUMAN 


076061 


homo sapien 


10 


352 


27. 


8 


302 


1 


STC2 MACNE 


097561 


macaca neme 


11 


349 


27. 


5 


296 


1 


STC2 RAT 


Q9r0k8 


rattus norv 


12 


181.5 


14. 


3 


197 


1 


STC2 CAVPO 


P57675 


cavia porce 


13 


156 


12. 


3 


40 


1 


STC ONCNE 


P43649 


oncorhynchu 


14 


94 


7. 


4 


1105 


1 


A3B1 MOUSE 


Q9zltl 


mus musculu 


15 


90.5 


7. 


1 


289 


1 


SNAA_VITVI 


P93798 


vitis vinif 


16 


87.5 


6. 


9 


1465 


1 


DPOA MOUSE 


P33609 


mus musculu 


17 


86.5 


6. 


8 


509 


1 


VP67 NPVCF 


P41717 


choristoneu 



18 


86 


6.8 


581 


1 


YMM7__YEAST 


Q03124 


saccharomyc 


19 


86 


6.8 


695 


1 


TRFL_HORSE 


077811 


equus cabal 


20 


84.5 


6.7 


1451 


1 


DPOA_RAT 


089042 


rattus norv 


21 


83 


6.5 


1094 


1 


A3 B INHUMAN 


000203 


homo sapien 


22 


82.5 


6.5 


1483 


1 


UFD4_YEAST 


P33202 


saccharomyc 


23 


82.5 


6.5 


2335 


1 


TORl_SCHPO 


014356 


schizosacch 


24 


81.5 


6.4 


708 


1 


TRFL_BOVIN 


P24627 


bos taurus 


25 


81.5 


6.4 


804 


1 


S15AJHUMAN 


Q8tag9 


homo sapien 


26 


81 


6.4 


4377 


1 


ANK3_HUMAN 


Q12955 


homo sapien 


27 


80 


6.3 


509 


1 


VP64_NPVOP 


P13625 


orgyia pseu 


28 


80 


6.3 


851 


1 


STR8_MOUSE 


Q8k031 


mus musculu 


29 


80 


6.3 


998 


1 


ECA3 ARATH 


Q9sy55 


arabidopsis 


30 


79.5 


6.3 


609 


1 


MC70_YEAST 


Q12411 


saccharomyc 


31 


79.5 


6.3 


708 


1 


TRFL_BUBBU 


077698 


bubalus bub 


32 


79 


6.2 


440 


1 


VI 17 FOWPV 


Q9j5a5 


fowlpox vir 


33 


78 . 5 


6.2 


132 


1 


IL4 HORSE 


P42202 


equus cabal 


34 


78 . 5 


6.2 


708 


1 


TRFL_CAPHI 


Q29477 


capra hircu 


35 


78 . 5 


6.2 


1127 


1 


MDM1_YEAST 


Q01846 


saccharomyc 


36 


78 


6.2 


458 


1 


IF3TJTORCA 


P23729 


torpedo cal 


37 


78 


6.2 


688 


1 


DNAK ANAVA 


005714 


anabaena va 


38 


78 


6.2 


803 


1 


GYRB_BUCAI 


P57126 


buchnera ap 


39 


78 


6.2 


1816 


1 


KF1B MOUSE 


Q60575 


mus musculu 


40 


77.5 


6.1 


640 


1 


CLAT MOUSE 


Q03059. 


mus musculu 


41 


77 


6.1 


249 


1 


TPIS TREPA 


083548 


treponema p 


42 


77 


6.1 


1023 


1 


STR8_HUMAN 


Q92502 


homo sapien 


43 


76.5 


6.0 


229 


1 


Z313 MOUSE 


Q9et26 


mus musculu 


44 


76.5 


6.0 


455 


1 


TBA1 SCHPO 


P04688 


schizosacch 


45 


76 


6.0 


324 


1 


NAA2 RHILO 


Q982f2 


rhizobium 1 



ALIGNMENTS 



RESULT 1 
STC1__HUMAN 

ID STC1_HUMAN STANDARD; PRT; 247. AA. 

AC P52823; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Stanniocalcin 1 precursor (STC-1) . 

GN STC1 OR STC. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fibrosarcoma, and Lung carcinoma; 

RX MEDLINE=96077825; PubMed=74 8 982 8 ; 

RA Chang A.C.-M. , Janosi J., Hulsbeek M. , de Jong D., Jeffrey K.J., 

RA Noble J.R., Reddel R.R.; 

RT "A novel human cDNA highly homologous to the fish hormone 

RT stanniocalcin."; 

RL Mol. Cell. Endocrinol. 112:241-247(1995). 
RN [2] 

RP SEQUENCE FROM N.A. 



RC TISSUE=Fetal lung; 

RX MEDLINE=96312491; PubMed=8700837 ; 

RA Olsen H.S., Cepeda M.A. , Zhang Q.-Q., Rosen C.A., Vozzolo B.L., 

RA Wagner G.F.; 

RT "Human stanniocalcin : a possible hormonal regulator of mineral 

RT metabolism."; 

RL Proc. Natl. Acad. Sci. U.S.A. 93:1792-1796(1996). 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Jeffrey K.J., Reddel R.R.; 

RT "Characterization of the human stanniocalcin 1 gene."; 

RL Submitted (MAR-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Colon, Kidney, and Stomach; 

RX MEDLINE=22388257; PubMed-12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A., 

RA Fahey J., Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C. f 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M. f Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Stimulates renal phosphate reabsorption, and could 

CC therefore prevent hypercalcemia. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

cc -i- TISSUE SPECIFICITY: Expressed in most tissues, with the highest 

CC levels in ovary, prostate, heart, kidney and thyroid. In the 

CC kidney, expression is confined to the nephron, specifically in the 

CC distal convoluted tubule and in the collecting tubule. Not 

CC detected in the brain, liver, spleen, peripheral blood leukocytes 

CC and adrenal medulla. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC 7"" 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinf ormatics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib . ch) . 



CC 

DR EMBL; U25997; AAC09472.1; 



DR EMBL; U46768; AAA88903.1; -. 

DR EMBL ; AF242179; AAL79522.1; -. 

DR EMBL; BC029044; AAH29044.1; 

DR Genew; HGNC: 11373; STCl. 

DR MIM; 601185; -. 

DR GO; GO: 0005181; F: glycopeptide hormone; TAS . 

DR GO; GO: 0005180; F: peptide hormone; TAS. 

DR GO; GO: 0006874; P: calcium ion homeostasis; TAS. 

DR GO; GO: 0007166; P:cell surface receptor linked signal transdu. . .; TAS. 



DR 


GO; GO: 0007267; 


P: 


cell-cell signaling; TAS. 


DR 


GO; GO:0007584; 


P: 


response to 


nutrients; TAS. 


DR 


InterPro; 


IPR004 


97 


8; Stanniocalcin. 


DR 


Pfam; PF03298; Stanniocalcin; 


1. 


KW 


Hormone; 


Signal ; 


Glycoprotein. x 


FT 


SIGNAL 


1 




17 


POTENTIAL. 


FT 


PROPEP 


18 




33 


POTENTIAL. 


FT 


CHAIN 


34 




247 


STANNIOCALCIN 1. 


FT 


DISULFID 


45 




. 59 


BY SIMILARITY. 


FT 


DISULFID 


54 




74 


BY SIMILARITY. 


FT 


DISULFID 


65 




114 


BY SIMILARITY. 


FT 


DISULFID 


98 




128 


BY SIMILARITY. 


FT 


DISULFID 


135 




170 


BY SIMILARITY. 


FT 


DISULFID 


202 




202 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


62 




62 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


247 AA; 


27621 MW; 


1E4A8BD861B49AED CRC64; 



Query Match 100.0%; Score 1268; DB 1; Length 247; 

Best Local Similarity 100.0%; Pred. No. 3.4e-102; 

Matches 247; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

| I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I 
Db 1 MLQN SAVLLVLVI S AS ATHEAEQNDS VS PRKS RVAAQN S AEWRCLN S ALQVGCGAFACL 60 

Q y 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

| | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

| | | | | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
D b 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

Qy 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

| I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I 

D b 181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 240 

Qy 241 RTSHESA 247 

I I I I I II 

Db 241 RTSHESA 247 



RESULT 2 
STCl_MOUSE 

ID STCl_MOUSE STANDARD; PRT; 247 AA. 

AC 055183; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 



DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Stanniocalcin 1 precursor (STC-1) . 

GN STC1 OR STC. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC STRAIN=BALB/c; 

RX MEDLINE=97179050; PubMed-9027337 ; 

RA Chang A.C.-M., Dunham M.A., Jeffrey K.J., Reddel R.R.; 

RT "Molecular cloning and characterization of mouse stanniocalcin cDNA. " ; 

RL Mol. Cell. Endocrinol. 124:185-187(1996). 

RN [2] 

RP SEQUENCE FROM N . A. 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T..E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A.", Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D .M. , Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I./ Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones , S . J .M. , Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Stimulates renal phosphate reabsorption, and could 

CC therefore prevent hypercalcemia (By similarity) . 

CC -!- SUBUNIT: Homodimer; disulf ide-linked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Expressed in many tissues. 
CC SIMILARITY: Belongs to the stanniocalcin family. 

CC ■ 7 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 
CC 

DR EMBL; U47815; AAC00050.1; -. 

DR EMBL; BC021425; AAH21425.1; -. 

DR MGD; MGI: 109131; Stcl . 

DR GO; GO: 0005615; C : extracellular space; IDA. 
DR InterPro; IPR004978; Stanniocalcin. 



DR 


Pfam; PF03298; Stanniocalcin; 


1. 


KW 


Hormone ; 


Signal ; 


Glycoprotein 




FT 


SIGNAL 


1 


17 


POTENTIAL. 


FT 


PROPEP 


18 


33 


POTENTIAL. 


FT 


CHAIN 


34 


247 


STANNIOCALCIN 1. 


FT 


DISULFID 


45 


59 


BY SIMILARITY. 


FT 


DISULFID 


54 


74 


BY SIMILARITY. 


FT 


DISULFID 


65 


114 


BY SIMILARITY. 


FT 


DISULFID 


98 


128 


BY SIMILARITY. 


FT 


DISULFID 


135 


170 


BY SIMILARITY. 


FT 


DISULFID 


202 


202 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


62 


62 


N-LINKED (GLCNAC. . .) (POT 


SQ 


SEQUENCE 


247 AA; 27480 MW; 


DAD30D08575A513B CRC64; 


Query Match 




97.3%; 


Score 1234; DB 1; Length 



Best Local Similarity 96.4%; 
Matches 238; Conservative 



Pred. No. 2.9e-99; 
5; Mismatches 4; 



Indels 



0; Gaps 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 MLQN S AVLLVLVI SASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 

|||||||:| I I I I I : I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

1 MLQN S AVI LALVI SAAAAHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 

61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGITSKVFLAI RRCSTFQRM 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 

I I I I I I : II I II I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

121 IAEVQEDCYSKLNVCSIAKRNPEAITEVIQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 



181 



0; 



60 



60 



120 



120 



180 



180 



240 



SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 

I | | | I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M 

181 SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEGDSPSHIK 240 



Qy 

Db 



241 RTSHESA 247 

III III 
241 RTSQESA 247 



RESULT 3 
STC1_RAT 

ID STC1_RAT STANDARD; PRT; 247 AA. 

AC P97574; 

DT 15-JUL-1998 (Rel. 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Stanniocalcin 1 precursor (STC-1) . 

GN STC1 OR STC. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 
OX NCBI_TaxID=10116; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Kidney; 

RA Abe T., Tanemoto M. , Hall A.E., Brown E.M., Hebert S.C.; 
RL Submitted (JAN-1997) to the EMBL/ GenBank/DDBJ databases. 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



FUNCTION: Stimulates renal phosphate reabsorption, and could 
therefore prevent hypercalcemia (By similarity) . 
SUBUNIT: Homodimer; disulf ide-linked (By. similarity). 
SUBCELLULAR LOCATION: Secreted. 

SIMILARITY: Belongs to the stanniocalcin family. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U62667; AAB39541.1; 
InterPro; I PRO 04 97 8; Stanniocalcin. 
Pfam; PF03298; Stanniocalcin; 1. 



Hormone; 


Signal; 


Glycoprot 


ein . 


SIGNAL 


1 


17 


POTENTIAL. 


PROPEP 


18 


33 


POTENTIAL. 


CHAIN 


34 


247 


STANNIOCALCIN 1. 


DISULFID 


45 


59 


BY SIMILARITY. 


DISULFID 


54 


74 


BY SIMILARITY. 


DISULFID 


65 


114 


BY SIMILARITY. 


DISULFID 


98 


128 


BY SIMILARITY. 


DISULFID 


135 


170 


BY SIMILARITY. 


DISULFID 


202 


202 


INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


62 


62 


N-LINKED (GLCNAC. . .) (POT 


SEQUENCE 


247 AA; 27507 


MW; DAC2FD08575A513B CRC64; 



Query Match 97.1%; Score 1231; DB 1; Length 247; 

Best Local Similarity 96.0%; Pred. No. 5.2e-99; 

Matches 237; Conservative 6; Mismatches 4; Indels 0; Gaps 0; 



Qy 


1 


MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 


60 






1 1 1 1 1 1 1 : 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MLQN S AVI LALVI S AAAAHEAEQND S VS P RK S RVAAQN S AEWRCLN S ALQVGCGAFACL 


60 


Qy 


61 


ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 


120 






I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGITSKVFLAIRRCSTFQRM 


120 


Qy 


121 


IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 


180 






1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


121 


IAEVQEDCYSKLNVCSIAKRNPEAITEVIQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 


180 


Qy 


181 


SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEEDSPSHIK 


240 






1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


181 


SLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRRTNEPQKLKVLLRNLRGEGDSPSHIK 


240 


Qy 


241 


RTSHESA 247 




Db 


241 


III 1:1 

RTSQENA 247 





RESULT 4 
STC ANGAU 



ID STC_ANGAU STANDARD; PRT; 250 AA. 

AC P18301; 

DT 01-NOV-1990 (Rel. 16, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Stanniocalcin precursor (STC) (Corpuscles of stannius protein) (CS) 

DE (Hypocalcin) (Teleocalcin) . 

GN STC. 

OS Anguilla australis (Australian eel) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Anguillif ormes ; Anguillidae; 

OC Anguilla. 

OX NCBI_TaxID=7940; 

RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 33-57. 

RX MEDLINE=88083961; PubMed-33197 39 ; 

RA Butkus A., Roche P.J., Fernley R.T., Haralambidis J., 

RA Penschow J.D., Ryan G.B., Trahair J.F., Tregear G.W., Coghlan J. P.; 

RT "Purification and cloning of a corpuscles of Stannius protein from 

RT Anguilla australis."; 

RL Mol. Cell. Endocrinol. 54:123-133(1987). 

RN [2] 

RP REVISIONS . 

RA Roche P. J. ; 

RL Submitted (DEC-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Its primary function is the prevention of hypercalcemia. 

CC Upon release into the circulation, it lowers calcium transport by 

CC the gills, thereby reducing its rate of influx from the 

CC environment into the extracellular compartment. STC also 

CC stimulates phosphate reabsorption by renal proximal tubules. The 

CC consequence of this action is increased levels of plasma 

CC phosphate, which combines with excess calcium and promotes its 

CC disposal into bone and scales. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- TISSUE SPECIFICITY: Corpuscles of Stannius. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC 

CC This SWTSS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC . 

DR EMBL; M36967; AAB91483.1; -. 

DR PIR; A54648; A54648. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Signal; Glycoprotein. 

FT SIGNAL 1 17 POTENTIAL. 

FT PROPEP 18 32 POTENTIAL. 

FT CHAIN 33 250 STANNIOCALCIN . 

FT DISULFID 44 58 BY SIMILARITY. 

FT DISULFID 53 73 BY SIMILARITY. 

FT DISULFID 64 113 BY SIMILARITY. 

FT DISULFID 97 127 BY SIMILARITY. 



FT DISULFID 134 169 BY SIMILARITY. 

FT DISULFID 201 201 INTERCHAIN (BY SIMILARITY). 

FT CARBOHYD 61 61 N-LINKED (GLCNAC. . .) (PROBABLE). 

SQ SEQUENCE 250 AA; 27174 MW; BB972BD951F75B3E CRC64 ; 

Query Match 53.8%; Score 682; DB 1; Length 250; 

Best Local Similarity 54.3%; Pred. No. l.le-51; 

Matches 138; Conservative 47; Mismatches 57; Indels 12; Gaps 6; 

Qy 1 MLQNSAVLLVLVISASATHEAEQNDSVSPRKSRVAAQNSJ\EWRCLNSALQVGCGAFACL 60 

| | : I : : I I i : : I : I : : : : : I I I : I : I : : : I I I I I I I I I I I I I I I I 

Db 1 MLRMSGLILTLVL-VTAAYEQDESEPLSPRTARFSASSPSDVARCLNGALQVGCSAFACL 59 

Qy 61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCI7^NGVTSKVFLAIRRCSTFQRM 120 

: I I I I : I I I I :: I I : I I I : I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I h I I : I 

Db 60 DNSTCNTDGMHEICRSFLHGAAKFDTQGKTFVKESLKCIANGITSKVFLTIRRCSSFQKM 119 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

I : | M I I I I I I I : : I I : I : MM: II I : I : I MM: I : : M I II M I I : I 

Db 120 ISEVQEECYSKLDLCSVAQSNPEAMGEVAQVPSQFPNRYYSTLLQSLLTCDEDTVEQVRA 179 

Qy 181 SLMEKIGPNMASLFHILQTDHC AQTHP-RADFNRRRTNEPQKLKVLLRNLRGEED 234 

I : : : I I M : I I I I I I I : I I I : M I : 

Db 18 0 GLVSRLEPEMGVLFQLLQTKACPPSAAGGTGPVGAGGSWRWPMGPPMFKI-QPNLRSRD- 237 

Qy 235 SPSHI— KRTSHES 246 

I : I : I : I I 
Db 238 -PTHLFAKKRSTSS 250 



RESULT 5 
STC_ONCKI 

ID STC_ONCKI STANDARD; PRT; 256 AA. 

AC Q08264; 

DT 01-FEB-1995 (Rel. 31, Created) 

DT 01-FEB-1995 (Rel. 31, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Stanniocalcin precursor (STC) (Corpuscles of stannius protein) (CS) 

DE (Hypocalcin) (Teleocalcin) . 

GN STC . 

OS Oncorhynchus kisutch (Coho salmon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmonif ormes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8019; 

RN [1] 

RP SEQUENCE FROM N. A. 

RX MEDLINE=93246046; PubMed=1363790; 

RA Wagner G.F., Dimattia G.E., Davie J.R., Copp D.H., Friesen H.G.; 

RT "Molecular cloning and cDNA sequence analysis of coho salmon 

RT stanniocalcin."; 

RL Mol. Cell. Endocrinol. 90:7-15(1992). 

RN [2] 

RP SEQUENCE OF 34-73. 

RX MEDLINE=89065334; PubMed=3197944 ; 

RA Wagner G.F., Fenwick J.C., Park CM., Milliken C, Copp D.H., 

RA Friesen H.G. ; 



RT "Comparative biochemistry and physiology of teleocalcin from sockeye 

RT and coho salmon."; 

RL Gen. Comp . Endocrinol. 72:237-246(1988). 

CC -!- FUNCTION: Its primary function is the prevention of hypercalcemia. 

CC Upon release into the circulation, it lowers calcium transport by 

CC the gills, thereby reducing its rate of influx from the 

CC environment into the extracellular compartment. STC also 

CC stimulates phosphate reabsorption by renal proximal tubules. The 

CC consequence of this action is increased levels of plasma 

CC phosphate, which combines with excess calcium and promotes its 

CC disposal into bone and scales. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

cc _j_ TISSUE SPECIFICITY: Produced and secreted by the corpuscles of 
CC stannius. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC ~" ; " 7~~ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement {See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; S59519; AAB26419.1; 

DR PIR; B60841; B60841. 

DR PIR; 151197; 151197. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Signal; Glycoprotein; Calcium transport. 
FT 
FT 
FT 
FT 
SQ 

Query Match 52.2%; Score 662.5; DB 1; Length 256; 

Best Local Similarity 53.4%; Pred. No. 5.3e-50; 

Matches 125; Conservative 46; Mismatches 54; Indels 9; Gaps 4; 

LVI SASATHEAEQNDSVSPRKSRVAAQNSAEVVRCLNSALQVGCGAFACLENSTCDTDGM 7 0 
||: : M : : : : | | I : : I : : : : : I I I i I I I II I I I I I I I I I I M I I I I 
L VLGTAAT FDT D P E EA- S P RRARF S S N S P S DVARC LN GALAVGC GT FAC LENSTCDTD GM 7 0 

YDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRMIAEVQEECYS 130 

: | | | : | : : I I I : I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I : I I I I I I I I 

HDICQLFFHTAATFNTQGKTFVKESLRCIANGVTSKVFQTIRRCGVFQRMISEVQEECYS 130 

KLN VC S I AKRN P E AI T E WQ L PNH F S N RY YN RLVRS L L EC D E DT VS T I RD S LME K I G PNM 190 
: | : : | : | : I I I II I I I I : I I I MM: h : II I I I M I M : I I : : : I I M 
RL D I CGVAR S N P EAI G EWQVP AH F PN RY Y S T L LQ S L LAC DE ET VAVVRAGLVARLG P DM 190 

ASLFHILQTDHCAQTHPRADFN RRRTNEPQKLKVLLRNLRGEEDSPSHI 239 

:|| :|| , II I : : I I M : MM 



SIGNAL 


1 . 


18 


POTENTIAL. 


PROPEP 


19 


33 


BY SIMILARITY. 


CHAIN 


34 


256 


STANNIOCALCIN. 


CARBOHYD 


62 


62 


N-LINKED (GLCNAC. . . ) . 


SEQUENCE 


256 AA; 


28074 


MW; E688B9.30B394A8D3 CRC64; 



Qy 


11 


Db 


12 


Qy 


71 


Db 


71 


Qy 


131 


Db 


131 


Qy 


191 


Db 


191 



RESULT 6 



STC_ONCMY 

ID STC_ONCMY STANDARD; PRT; 2 56 AA. 

AC P43648; Q98SE1; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Stanniocalcin precursor (STC) (Corpuscles of stannius protein) (CS) 

DE (Hypocalcin) (Teleocalcin) . 

GN STC. 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metaz.oa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmoni formes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8 022; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RA McCudden C.R., Wagner G.F.; 

RT "Rainbow trout ovarian stanniocalcin."; 

RL Submitted (DEC-2000) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 34-66. 

RC TISSUE=Stannius corpuscules; 

RX MEDLINE=8 81968 01; PubMed=3 3 6028 8; 

RA Lafeber F.P.J.G., Hanssen R.G.J.M., Choy Y.M., Flik G. , 

RA Herrmann-Erlee M.P.M., Pang P.K.T., Wendelaar Bonga S.E.; 

RT "Identification of hypocalcin (teleocalcin) isolated from trout 

RT Stannius corpuscles."; 

RL Gen. Comp. Endocrinol. 69:19-30(198 8). 

CC -!- FUNCTION: Its primary function is the prevention of hypercalcemia. 

CC Upon release into the circulation, it lowers calcium transport by 

CC the gills, thereby reducing its rate of influx from the 

CC environment into the extracellular compartment. STC also 

CC stimulates phosphate reabsorption by renal proximal tubules. The 

CC consequence of this action is increased levels of plasma 

CC phosphate, which combines with excess calcium and promotes its 

CC disposal into bone and scales. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Produced and secreted by the corpuscles of 
CC stannius. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC ■ ; 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC . — 

DR EMBL; AF326317; AAK01423.1; -. 

DR EMBL; AF326318; AAK01424.1; -. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 



KW 


Hormone ; 


Signal ; 


Glycoprotein 


; Calcium transport. 


FT 


SIGNAL 


1 


18 


POTENTIAL. 


FT 


PROPEP 


19 


33 




FT 


CHAIN 


34 


256 


STANNIOCALCIN. 


FT 


CARBOHYD 


62 


62 


N-LINKED (GLCNAC. . .). 


FT 


VARIANT 


41 


41 


D -> E. 


SQ 


SEQUENCE 


256 AA; 28104 MW; 


E688A0E0B394A8D3 CRC64; 



Query Match 52.2%; Score 662.5; DB 1; 

Best Local Similarity 53.4%; Pred. No. 5.3e-50; 
Matches 125; Conservative 46; Mismatches 54; 



Length 256; 
Indels 9; 



Gaps 



QY 
Db 

Qy 

Db 

Qy 

Db 



11 



12 



71 



LVI S AS ATH EAEQN D S VS P RKS RVAAQN S AEVVRCLN S ALQVGCGAFACLEN S T C DT DGM 

II: : I I : : :: I I I : : I :: : : : I I I I I II I I I I I I I I I I II I I I I I I 

LVLGTAATFDTDPEEA-SPRRARFSSNSPSDVARCLNGALAVGCGTFACLENSTCDTDGM 



YDI CKS FLYSAAKFDTQGKAFVKES LKCI ANGVTSKVFLAI RRCSTFQRMI AEVQEECYS 
: I I I : I : : I I I : I I I I 111111:11111111111 MM II II I : II M II M 
71 HDICQLFFHTAATFNTQGKTFVKESLRCIANGVTSKVFQTIRRCGVFQRMISEVQEECYS 

131 KLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNM 
: I : : I : I : I II II I I I I : I II MM: I : : M I M I : M : : I I : —MM 
131 RL D I C GVARS N P EAI GEVVQVP AH FPNRYYSTL LQ S LLAC DE ET VAVVRAGLVARL G P DM 



4; 



70 



70 



130 



130 



190 



190 



Qy 191 ASLFHILQTDHCAQTHPRADFN RRRTNEPQKLKVLLRNLRGEEDSPSHI 239 

Ml Ml II I : : I I I : : : I I : I : I : 

Db 191 ETLFQLLQNKHCPQGSNQGPNSAPAGWRWPMGS PPS FKI -QPSMRGRD- - PTHL 241 



RESULT 7 
STC_ONCKE 

ID STC_ONCKE STANDARD; PRT; 179 AA. 

AC P43647; Q91427; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE . Stanniocalcin (STC) (Corpuscles of stannius protein) (CS) (Hypocalcin) 

DE (Teleocalcin) . 

GN STC. 

OS Oncorhynchus keta (Chum salmon) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmonif ormes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8018; 

RN [1] 

RP SEQUENCE FROM N.A. , AND SEQUENCE. 

RC TISSUE=Stannius corpuscules; 

RX MEDLINE=96077816; PubMed-74 89819; 

RA Yamashita K. , Koide Y. , Itoh H., Kawada N. , Kawauchi H . ; 

RT "The complete amino acid sequence of chum salmon stanniocalcin, a 

RT calcium-regulating hormone in teleosts." ; 

RL Mol. Cell. Endocrinol. 112:159-167(1995). 

RN [2] 

RP SEQUENCE OF 1-33. 

RC TISSUE=Stannius corpuscules; 

RX MEDLINE=93055697; PubMed=1430418 ; 

RA Sundell K., Bjoernsson B.T., Itoh H., Kawauchi H.; 



RT "Chum salmon (Oncorhynchus keta) stanniocalcin inhibits in vitro 

RT intestinal calcium uptake in Atlantic cod (Gadus morhua)."; 

RL J. Comp. Physiol. B 162:489-495(1992). 

RN [3] 

RP DISULFIDE BONDS. 

RX MEDLINE=99216273; PubMed=10198206; 

RA Hulova I., Kawauchi H. ; 

RT "Assignment of disulfide linkages in chum salmon stanniocalcin."; 

RL Biochem. Biophys . Res. Commun . 257:2 95-299(1999). 

CC -!- FUNCTION: Its primary function is the prevention of hypercalcemia. 

CC Upon release into the circulation, it lowers calcium transport by 

CC the gills, thereby reducing its rate of influx from the 

CC environment into the extracellular compartment. STC also 

CC stimulates phosphate reabsorption by renal proximal tubules. The 

CC consequence of this action is increased levels of plasma 

CC phosphate, which combines with excess calcium and promotes its 

CC disposal into bone and scales. 

CC SUBUNIT: Homodimer; disulf ide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Produced and secreted by the corpuscles of 
CC stannius. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; S80134; AAB35648.2; 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Glycoprotein; Calcium transport. 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 



0; Gaps 0; 

Qy 40 AEWRCLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCI 99 

: : I I I I I I I I I I I I I I I I I I I I I I I I I : I I I : I : : I I I : I I I I I I I I I I : I I 
Db 7 SDVARCLNGALDVGCGTFACLENSTCDTDGMHDICQLFFHTAATFNTQGKTFVKESLRCI 66 

Qy 100 ANGVTSKVFLAI RRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRY 159 

I II I I I I I I II I I I I I I I : I I I I I I I I : I : : I : I : I I I I I I I I I : I II III 
Db 67 AN GVT S KV FQ T I RRC GVFQ RM I S EVQ EECYSRLDIC G VAR S N P EAI G E WQ VP AH F PN R Y 126 



DISULFID 


12 


26 






DISULFID 


21 


41 






DISULFID 


32 


81 






DISULFID 


65 


95 






DISULFID 


102 


137 






DISULFID 


169 


169 


INTERCHAIN. 




CARBOHYD 


29 


29 


N-LINKED (GLCNAC. 


. .) . 


CONFLICT 


160 


160 


P -> L (IN REF. 1; 


AA SEQUENCE) 


CONFLICT 


170 


170 


S -> P (IN REF. 1; 


AA SEQUENCE) 


! SEQUENCE 


17 9 AA; 


19518 MW; 


0298CAC33E2BE445 


CRC64; 


Query Match 




48.7%; 


Score 617; DB 1; 


Length 179; 


Best Local Similarity 


66.1%; 


Pred. No. 2.9e-46; 




Matches 109; 


Conservative 2 8 


; Mismatches 28 


; Indels 



Qy 160 YNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHILQTDHCAQ 204 

I : I : : I I I I I I :. M : : I I : : : I I : I : I : I | I I : I 
Db 12 7 YSTLLQSLLACDEETVAWRAGLVARLGPDMETPFQLLQNKHCSQ 171 



RESULT 8 
STC2_MOUSE 

ID STC2_MOUSE STANDARD; PRT; 296 AA. 

AC 088452; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Stanniocalcin 2 precursor (STC-2) . 

GN STC2 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [ 1 ] 

RP SEQUENCE FROM N.A. ' 

RX MEDLINE=98389283; PubMed=97238 90 ; 

RA Chang A.C.-M., Reddel R.R.; 

RT "Identification of a second stanniocalcin cDNA in mouse and human: 

RT stanniocalcin 2."; 

RL Mol. Cell. Endocrinol. 141:95-99(1998). 

RN [2] 

RP SEQUENCE FROM N.A., AND CHARACTERIZATION. 

RC TISSUE^Breast carcinoma; 

RX MEDLINE=99378040; PubMed=10450 831 ; 

RA Moore E.E., Kuestner R.E., Conklin D.C., Whitmore T.E., Downey W., 

RA Buddie M.M., Adams R.L., Bell L.A., Thompson D. L . , Wolf A. , Chen L., 

RA Stamni M. R. , Grant F.J., Lok S., Ren H., de Jongh K.S.; 

RT "Stanniocalcin 2: characterization of the protein and its localization 

RT to human pancreatic alpha cells."; 

RL ' Horm. Metab . Res. 31:406-414(1999). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Breast tumor; 

RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R. L . , Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R. D . , Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J . , Hsieh F. , 

RA Diatchenko L. f Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T . B . , Toshiyuki S., Carninci P., Prange C. A 

RA Raha S.S., Loquellano N.A., Peters G.J. f Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J. f McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 



RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Has an anti-hypocalcemic action on calcium and phosphate 
CC homeostasis. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted (Potential). 

CC -!- TISSUE SPECIFICITY: Found in a variety of tissues including 
CC skeletal muscle, small intestine, kidney, liver and brain. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation — 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF056244; AAC27507.1; -. 

DR EMBL; AF031035; AAD01921.1; 

DR EMBL; BC012206; AAH12206.1; -. 

DR MGD; MGI: 1316731; Stc2 . 

DR ' InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Signal; Glycoprotein. 

FT SIGNAL 1 2 4 POTENTIAL. 

FT CHAIN 25 296 STANNIOCALCIN 2. 

FT CARBOHYD 73 73 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 296 AA; 32601 MW; 0C1EF4008912DE68 CRC64; 



Query Match 28.0%; Score 354.5; DB 1; Length 296; 

Best Local Similarity 31.7%; Pred. No. 2.2e-23; 

Matches 85; Conservative 45; Mismatches 109; Indels 2 9; Gaps 5; 

Qy 7 VLLVLVI SASATHEAEQNDSVSP RK S RVAAQN S AE WRC LN S ALQ VGC GA 56 

I I I I : I : I I : I : I I :: I I : I I : I I : I I I I I 

Db 10 VTLALVF— ATLDPAQGTDSTNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGV 67 

Qy 57 FACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCST 116 

I I I I : : I : I : : I I : I I : : I I I I I I I : I : I : : I : I I : : I I : I 

Db 68 FECFENNSCEIQGLHGICMTFLHNAGKFDAQGKSFIKDALRCKAHALRHKFGCISRKCPA 127 

Qy 117 FQRMIAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVS 176 

: I : : : I I I I I : : II I : I I I : : s : I II Mill 
Db 128 IREMVFQLQRECYLKHDLCSAAQENVGVIVEMIHFKDLLLHEPYVDLVNLLLTCGEDVKE 187 

Qy 177 TIRDSLMEKIGPNMASLFHIL QTDHCAQTH PRADFNRRRTNEPQKLKV 22 4 

: I : : : I I I : II I I : I : 

Db 188 AVTRSVQAQCEQSWGGLCSILSFCTSNIQRPPTAAPEHQPLADRAQLSRPHHRDTDHHLT 247 

Qy 225 LLRNLRGEEDSPSHIK RTSHESA 247 

I : I I I I I I I : I I 

Db 24 8 ANRGAKGERGSKSHPNAHARGRTGGQSA 275 



RESULT 9 



STC2_HUMAN 

ID STC2_HUMAN STANDARD; PRT; 302 AA. 

AC 076061; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, .Last annotation update) 

DE Stanniocalcin 2 precursor (STC-2) (Stanniocalcin-related protein) 

DE (STCRP) (STC-related protein) . 

GN STC2. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=98389283; PubMed=9723890; 

RA Chang A.C.-M., Reddel R.R.; 

RT - "Identification of a second stanniocalcin cDNA in mouse and human: 

RT stanniocalcin 2." ; 

RL Mol. Cell. Endocrinol. 141:95-99(1998). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Osteosarcoma; 

RX MEDLINE=98440784; PubMed=9753616 ; 

RA Ishiabshi K. , Miyamoto K. , Taketani Y., Morita K., Takeda E., 

RA Sasaki S., Imai M. ; 

RT "Molecular cloning of a second human stanniocalcin homologue (STC2)."; 

RL Biochem. Biophys . Res. Commun. 250:252-258(1998). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=99145369; PubMed=1002277 1 ; 

RA DiMattia G. E. , Varghese R., Wagner G.F.; 

RT "Molecular cloning and characterization of stanniocalcin-related 

RT protein."; . " 

RL Mol. Cell. Endocrinol. 146:137-140(1998). 

RN [4] 

RP SEQUENCE FROM N.A., AND CHARACTERIZATION. 

RC TISSUE=Breast carcinoma; 

RX MEDLINE=99378040; PubMed=10450831 ; 

RA Moore E.E., Kuestner R.E., Conklin D.C., Whitmore T.E., Downey W. , 

RA Buddie M.M., Adams R. L . , Bell L.A., Thompson D.L., Wolf A., Chen L., 

RA Stamm M. R. , Grant F.J. / Lok S., Ren H., de Jongh K.S.; 

RT "Stanniocalcin 2: characterization of the protein and its localization 

RT to human pancreatic alpha cells."; 

RL Horm. Metab. Res. 31:406-414(1999). 

RN [5] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Kidney, and Muscle; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G. D. , 

RA Altschul S.F., Zeeberg B . f Buetow K.H., Schaefef C.F., Bhat N . K . , 

RA Hopkins R.F. f Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. r Marusina K., Farmer A. A. , Rubin G.M., Hong L . r 

RA Stapleton M. f Soares M. B . , Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N . A. f Peters G.J. f Abramson R. D . , Mullahy S.J. f 



RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P . H . , 

RA Richards S.,-Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk-S.W., 

RA Villaion D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A. , Young A.C., Shevchenko Y . , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Has an anti-hypocalcemic action on calcium and phosphate 
CC homeostasis . 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted (Potential). 

CC -!- TISSUE SPECIFICITY: Expressed in a variety of tissues including 
CC muscle, heart, pancreas, kidney, spleen, prostate, small 

CC intestine, colon and peripheral blood Leucocytes. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC ^ 

DR EMBL; AF055460; AAC27036.1; -. 

DR EMBL; AB012664; BAA33489.1; -. 

DR EMBL; AF098462; AAC97948.1; -. 

DR EMBL; AF031036; AAD01922.1; 

DR EMBL; BC000658; AAH00658.1; -. 

DR EMBL; BC006352; AAH06352.1; -. 

DR EMBL; BC013958; AAH13958.1; -. 

DR PIR; JE0357; JE0357. 

DR Genew; HGNC: 11374; STC2 . 

DR MIM; 603665; -. 

DR GO; GO: 0005181; F: glycopeptide hormone; TAS . 

DR GO; GO: 0005180; F:peptide hormone; TAS. 

DR GO; GO:0007166; P:cell surface receptor linked signal transdu. . .; TAS. 

DR GO; GO: 0007267; P: cell-cell signaling; TAS. 

DR GO; GO: 0007584; P: response to nutrients; TAS. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Signal; Glycoprotein. 

FT SIGNAL 1 24 POTENTIAL . 

FT CHAIN 25 302 STANNIOCALCIN 2. 

FT CARBOHYD 73 73 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 302 AA; 33248 MW; 9B90D8911524FA22 CRC64; 

Query Match 27.9%; Score 354; DB 1; Length 302; 

Best Local Similarity 32.4%; Pred. No. 2.5e-23; 

Matches 83; Conservative 45; Mismatches 102; Indels 26; Gaps 4; 
Qy 9 LVLVIS ASATHEAE-QNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 



Ill:: II: I I I :| |:: ||:||: II :| MM I I 

Db 12 LALVLATFDPARGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 

Qy 61 ENSTCDTDGMYDI CKS FLYSAAKFDTQGKAFVKESLKCIANGWSKVFLAI RRCST FQRM 12 0 

I I : : I : I : : M : M : : I I I I M I : I : I : : I I I I : : : I : I : I 

Db 72 ENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGCISRKCPAIREM 131 

Qy 121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

: : : : I M I I : : I : I : I I I : : : : I II Mil: I 
Db 132 VSQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 

Qy 181 SLMEKIGPNMASLFHILQ-TDHCAQTHPRADFNRRRTNEPQKL 222 

I: : I M M I I I |: : M 

Db 192 SVQVQCEQNWGSLCSILSFCTSAIQKPPTAPPERQPQVDRTKLSRAHHGEAGHHLPEPSS 251 

Qy 223 KVLLRNLRGEEDSPSH 238 

: I : I I III 
Db 252 RETGRGAKGERGSKSH 267 



RESULT 10 



STC2_MACNE 

ID STC2_MACNE STANDARD; PRT; 302 AA. 

AC 097561; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Stanniocalcin 2 precursor (STC-2) . 

GN STC2 . 

OS Macaca neraestrina (Pig-tailed macaque) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9545; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE^Pancreatic islets; • 

RX MEDLINE=99378040; PubMed-10450831 ; 

RA Moore E.E., Kuestner R.E., Conklin D.C., Whitmore T.E., Downey W. , 

RA Buddie M.M. , Adams R.L., Bell L.A., Thompson D.L., Wolf A., Chen L., 

RA StammM.R., Grant F.J., Lok S., Ren H., de Jongh K.S.; 

RT "Stanniocalcin 2: characterization of the protein and its localization 

RT to human pancreatic alpha cells."; 

RL Horm. Metab . Res. 31:406-414(1999). 

CC -!- FUNCTION: Has an anti-hypocalcemic action on calcium and phosphate 
CC homeostasis. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted (Potential). 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC ■ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



cc 

DR EMBL;- AF035377; AAD02027.1; -. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Signal; Glycoprotein. 

FT SIGNAL 1 24 POTENTIAL. 

FT CHAIN 25 302 STANNIOCALCIN 2. 

FT CARBOHYD 73 73 N-LINKED ( GLCNAC . . .) (POTENTIAL) 

SQ SEQUENCE 302 AA; 33281 MW; 62953CE958AF64C1 CRC64; 



Query Match 27.8%; Score 352; DB 1; Length 302; 

Best Local Similarity 32.8%; Pred. No. 3.7e-23; 

Matches 84; Conservative 44; Mismatches 102; Indels 26; 



Gaps 



5; 



Qy 

Db 



9 LVLVIS AS ATH EAE- QND S VS P RK S RVAAQN S AE WRC LN S ALQVGCGAFAC L 60 

III:: II: I I I : I I : : I I : I I : I I : I I II I I I 

12 LALVLATIDP7VRGTDATNPPEGPQDRSSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECF 71 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 ENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCSTFQRM 120 

I I : : I : I : : I I : I I : : I I I I I I I : I : I : : I I I I : : : I : I : I 

72 ENNSCEIRGLHGICMTFLHNAGKFDAQGKSFIKDALKCKAHALRHRFGCISRKCPAIREM 131 

121 IAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRD 180 

: : : I I I I I : : I : I : I I I : : : : I II MM: I 

132 VFQLQRECYLKHDLCAAAQENTRVIVEMIHFKDLLLHEPYVDLVNLLLTCGEEVKEAITH 191 

181 SLMEKIGPNMASLFHILQTDHCAQTHP RAD FN RRRTN E PQ-KL 222 

I : : I I I I I I I .11:1 I • 

192 SVQVQCEQNWGSLCSILSFCTSAIQRPPTAPPERQPQVDRAKLSRAHHGEAGHHLPEPSS 251 

223 KVLLRNLRGEEDSPSH 238 

: I : I I III 
252 RETGRGAKGERGSKSH 2 67 



RESULT 11 
STC2_RAT 

ID STC2_RAT STANDARD ; PRT; 296 AA. 

AC Q9R0K8; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Stanniocalcin 2 precursor (STC-2). 

GN STC2 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; 

RX MEDLINE=99439722; PubMed=10508929 ; 

RA Honda S., Kashiwagi M. , Ookata K., Tojo A. 

RT "Regulation by lalpha, 25-dihydroxyvitamin 

RT stanniocalcin messages in the rat kidney and ovary."; 

RL FEBS Lett. 459:119-122(1999). 

CC -!- FUNCTION: Has an anti-hypocalcemic action on calcium and phosphate 



Hirose S . ; 
D(3) of expression of 



CC homeostasis. 

CC -!- SUBUNIT: Homodimer ; disulfide- linked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted (Potential). 

CC -!- TISSUE SPECIFICITY: Expressed in a variety of tissues. Strongly 
CC expressed in ovary and to a lesser extent in kidney. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC ^ 

DR EMBL; AB030707; BAA85251.1; -. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Signal; Glycoprotein. 

FT SIGNAL 1 24 POTENTIAL. 

FT CHAIN 25 296 STANNIOCALCIN 2. 

FT CARBOHYD 73 73 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 296 AA; 32621 MW; 0903639858D9B2DC CRC64; 



Query Match 27.5%; Score 349; DB 1; Length 296; 

Best Local Similarity 31.8%; Pred. No. 6.6e-23; 

Matches 81; Conservative 44; Mismatches 106; Indels 24; Gaps 4; 

Qy 7 VLLVLVISASATHEAEQNDSVSP RKS RVAAQNS AEWRCLN S ALQVGCGA 56 

I I I I : I I I : I : | | : : | | : | | : | | : | | | M 

Db 10 VTLALVF — ATLDPARGTDSTNPPEGPQDRGSQQKGRLSLQNTAEIQHCLVNAGDVGCGV 67 

Qy 57 FACLENSTCDTDGMYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAIRRCST 116 

I I I I : : I : I : : I I : I I : : I I I I I I I : I : I : : I : I I : . : I hi 
Db 68 FECFENNSCEI QGLHGI CMT FLHNAGKFDAQGKS FI KDALRCKAHALRHKFGCI SRKCPA 127 

Qy 117 FQRMIAEVQEECYSKLNVCSIAKRNPEAITEWQLPNHFSNRYYNRLVRSLLECDEDTVS 17 6 

: I : :: I I I I I : : I i I : I I I : : : : I II I I I I I 

Db 128 IREMVYQLQRECYLKHDLCSAAQENVWIVEMIHFKDLLLHEPYVDLVNLLLTCGEDVRE 187 

Qy 177 TIRDSLMEKIGPNMASLFHIL QTDHCAQTH— — PRADFNRRRTNEPQKLKV 224 

: I : : : I I I : II II : I : 

Db 188 AWRSVQAQCEQSWGGLCSILSFCTSNIQRPPTAAPEHQPLADRAQLSRPYHRDTDHHLT 247 

Qy 225 LLRNLRGEEDSPSHI 239 

I : I I I I I : 
Db 248 ANRGAKGERGSKSHL 2 62 



RESULT 12 
STC2_CAVP0 

ID STC2_CAVPO STANDARD; PRT; 197 AA. 

AC P57675; 

DT 16-OCT-2001 (Rel. 40, Created) 
DT 16-OCT-2001 (Rel. 40, Last sequence update) 
DT 16-OCT-2001 (Rel. 40, Last annotation update) 
DE Stanniocalcin 2 (STC-2) (Fragments). 



GN STC2 . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi; 

OC Mammalia; Eutheria; Rodentia; Hys tricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=99378040; PubMed=10450831 ; 

RA Moore E.E., Kuestner R.E., Conklin D.C. f Whitmore T.E., Downey W., 

RA Buddie M.M. , Adams R.L., Bell L. A. , Thompson D.L., Wolf A., Chen L., 

RA Stanim M.R., Grant F.J., Lok S., Ren H., de Jongh K.S.; 

RT "Stanniocalcin 2: characterization of the protein and its localization 

RT to human pancreatic alpha cells."; 

RL Horm. Metab. Res. 31:406-414(1999). 

CC -!- FUNCTION: Has an anti-hypocalcemic action on calcium and phosphate 
CC homeostasis. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked (By similarity). 
CC SUBCELLULAR LOCATION: Secreted (Potential). 

CC 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Best Local Similarity 25.8%; Pred. No. l.le-08; 

Matches 58; Conservative 27; Mismatches 85; Indels 55; Gaps 6; 

Qy 25 DSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDTDGMYDICKSFLYSAAKF 84 

I | : | | : : | | : | | : II ♦ I I I I I I I I I : I : : I 
Db 12 DRGSQQKGRLSLQNTAEIQHCLVNAGDVGCGVFECFENNXCXIXXLHXI 60 

Qy 85 DTQGKAFVKESLKCIANGWSKVFLAIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEA 144 

:|:| 1:1 : I : : : I I I I I : : I I I 
Db 61 SFIKAH ALRFGCISREMVFQLQRECYLKHDLCMINFR 97 

Qy 145 ITEWQLPNHFSNRYYNRLVRSLLECDEDTVSTIRDSLMEKIGPNMASLFHIL QT 199 

: : I II II I II : | : : | II 
Db 98 DLLLHEPYVDLVNLLLTCGEDVKEAVTRSIQAQCEQNWGGLCSILSFCTSNV 149 

Qy 2 00 DHCAQTHPRADFNRRRTNEP QKLKVLLRNLRGEEDSPSH 238 

I I I I : : I I : : I I I I I 

Db 150 QRPXAXQPXAD — RAQVSRPHHHDTGHHLLEAIXGAKGERGSKSH 192 



-!- SIMILARITY: 


Belongs 


to the stanniocalcin family. 


r Hormone. 








NON CONS 


60 


61 




NON_CONS 


64 


65 




NON_CONS 


69 


70 




NON_CONS 


75 


76 




NON_CONS 


92 


93 




NON CONS 


154 


155 




SEQUENCE 


197 AA; 218 


99 MW; 1512DA859C8A8E67 CRC64 ; 


Query Match 




14 


.3%; Score 181.5; DB 1; Length 



RESULT 13 
STC_ONCNE 

ID STC_ONCNE STANDARD; PRT; 4 0 AA. 

AC P43649; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Stanniocalcin (STC) (Corpuscles of stannius protein) (CS). (Hypocalcin) 



DE (Teleocalcin) (Fragment) . 

GN STC. 

OS Oncorhynchus nerka (Sockeye salmon) . 

OC Eukaryota; Metazoa; Chorclata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmoni formes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8023; 

RN [1] 

RP SEQUENCE. 

RX MEDLINE=89065334; PubMed=3197 944 ; 

RA Wagner G.F., Fenwick J.C., Park CM. , Milliken C, Copp D.H., 

RA Friesen H.G. ; 

RT "Comparative biochemistry and physiology of teleocalcin from sockeye 

RT and coho salmon."; 

RL Gen. Comp. Endocrinol. 72:237-246(1988). 

CC -!- FUNCTION: Its primary function is the prevention of hypercalcemia 

CC Upon release into the circulation, it lowers calcium transport by 

CC the gills, thereby reducing its rate of influx from the 

CC environment into the extracellular compartment. STC also 

CC stimulates phosphate reabsorption by renal proximal tubules. The 

CC consequence of this action is increased levels of plasma 

CC phosphate, which combines with excess calcium and promotes its 

CC disposal into bone and scales. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: Produced and secreted by the corpuscles of 
CC stannius. 

CC -!- SIMILARITY: Belongs to the stanniocalcin family. 

DR PIR; A60841; A60841. 

DR InterPro; IPR004978; Stanniocalcin. 

DR Pfam; PF03298; Stanniocalcin; 1. 

KW Hormone; Glycoprotein; Calcium transport. 

FT CARBOHYD 29 29 N-LINKED (GLCNAC. . .). 

FT UNSURE 29 29 

FT NON_TER 40 4 0 

SQ SEQUENCE 40 AA; 4134 MW; 0C517B7BBBC5EE6C CRC64; 



Query Match 12.3%; Score 156; DB 1; Length 40; 

Best Local Similarity 79.4%; Pred. No. 2.5e-07; 

Matches 27; Conservative 3; Mismatches 4; Indels 



0; Gaps 



Qy 

Db 



40 AEWRC LN S ALQ VG C GAFAC LENSTCDTD GM Y D I 73 
: : I I I II II I I II I I I I I I I I I I I I I I : I I 
7 S D VARC LN GAL D VG C GT FAC LENSTCDTD GMH D I 4 0 



PRT; 1105 AA. 



RESULT 14 
A3B1_M0USE 

ID A3B1_M0USE STANDARD; 
AC Q9Z1T1; 

DT. 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Adapter-related protein complex 3 beta 1 subunit (Beta-adaptin 3A) 
DE (AP-3 complex beta-3A subunit) (Beta-3A~adaptin) . 
GN AP3B1. 

OS Mus musculus (Mouse) . 



OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae;- Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C3H/HeJ; 

RX MEDLINE=99135912; PubMed=993 134 0 ; 

RA Feng L., Seymour A.B., Jiang S.Y., To A. , Peden A. A. , Novak E.K., 

RA Zhen L., Rusiniak M.E., Eicher E.M., Robinson M.S., Gorin M.B., 

RA Swank R.T. ; 

RT "The beta-3A subunit gene (Ap3bl) of the AP-3 adaptor complex is 

RT altered in the mouse hypopigmentation mutant pearl, a model for 

RT Hermansky-Pudlak syndrome and night blindness."; 

RL Hum. Mol. Genet. 8:323-330(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN-129/Sv; 

RX MEDLINE=20512043; PubMed=11056055 ; 

RA Feng L., Rigatti B.W., Novak E.K., Gorin M.B., Swank R.T.; 

RT "Genomic structure of the mouse ap3bl gene in normal and pearl mice."; 

RL Genomics 69:370-379(2000). 

CC -!- FUNCTION: Part of the AP-3 complex/ an adaptor-related complex 

CC which is not clathrin-associated. The complex is associated with 

CC the Golgi region as well as more peripheral structures. It 

CC facilitates the budding of vesicles from the Golgi membrane and 

CC may be directly involved in trafficking to lysosomes. 

CC -!- SUBUNIT: Assembly protein complex 3 (AP-3) is a heterotetramer 

CC composed of two large chains (delta and beta3) , a medium chain 

CC (mu3) and a small chain (sigma3) . 

CC -!- PTM: Phosphorylated on serine residues (By similarity). 

CC -!- DISEASE: Defects in AP3B1 are the cause of the autosomal recessive 

CC phenotype 1 pearl 1 (pe) . Pearl mice exhibit hypopigmentation, 

CC lysosomal secretion abnormalities, and platelet-dense granules 

CC with reduced levels of adenine nucleotides and serotonin. The 

CC changes in platelets lead to prolonged bleeding. Additionally, 

CC pearl mice exhibit reduced sensitivity in the dark-adapted state. 

CC -!- SIMILARITY: Belongs to the adaptor complexes large subunit 

CC family. 

CC r 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC ■ 

DR EMBL; AF103809; AAC78338.1; -. 

DR EMBL; AF255589; AAG23622.1; -. 

DR EMBL; AF255566; AAG23622.1; JOINED. 

DR EMBL; AF255567; AAG23622.1; JOINED. 

DR EMBL; AF255568; AAG23622.1; JOINED. 

DR EMBL; AF255569; AAG23622.1; JOINED. 

DR EMBL; AF255570; AAG23622.1; JOINED. 

DR EMBL; AF2 55571; AAG23622.1; JOINED. 

DR EMBL; AF255572; AAG23622.1; JOINED. 

DR EMBL; AF255573; AAG23622.1; JOINED. 



DR 


EMBL, 


AF255574; 


AAG23622 . 


1 


; JOINED. 


DR 


EMBL, 


AF255575; 


AAG2 3622 . 


1, 


? JOINED. 


DR 


EMBL, 


AF255576; 


AAG23622 . 


1, 


r JOINED. 


DR 


EMBL, 


AF255577; 


AAG23622 . 


1, 


; JOINED. 


DR 


EMBL, 


AF255578; 


AAG23622 . 


1, 


• JOINED. 


DR 


EMBL, 


AF255579; 


AAG23622 . 


1, 


; JOINED. 


DR 


EMBL, 


AF255580; 


AAG23622 . 


1/ 


' JOINED . 


DR 


EMBL, 


AF255581; 


AAG23622 . 


1, 


• JOINED . 


DR 


EMBL, 


AF255582; 


AAG23622 . 


1, 


• JOINED 


DR 


EMBL, 


AF255583; 


AAG23622 . 


1 / 


' JOINED. 


DR 


EMBL, 


AF255584; 


AAG23622. 


1 , 


• JOINED. 


DR 


EMBL, 


AF255585; 


AAG23622. 


1, 


' JOINED . 


DR 


EMBL, 


AF255586; 


AAG23622. 


1. 


• JOINED. 


DR 


EMBL, 


AF255587; 


AAG23622. 


1, 


• JOINED. 


DR 


EMBL, 


AF255588; 


AAG23622. 


1, 


• JOINED. 


DR 


PIR; 


T18295; T18295. 






DR 


MGD; 


MGI: 1333879; Ap3bl. 







DR InterPro; IPR002553; Adapting. 

DR InterPro; IPR008938; ARM.. 

DR Pfam; PF01602; Adapt in_N; 1. 

KW Golgi stack; Protein transport; Transport; Phosphorylation. 

FT DOMAIN 678 802 GLU/SER-RICH . 

SQ SEQUENCE 1105 AA; 122869 MW; 586B818CE4FB5AEE CRC64; 

Query Match 7.4%; Score 94; DB 1; Length 1105; 

Best Local Similarity 21.2%; Pred. No. 3.1; 

Matches 55; Conservative 40; Mismatches 104; Indels 60; Gaps 11; 

Qy 1 MLQNSAVLLVXVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACL 60 

: I I : : I : : : I : I I : I I : I I I I : I : 

Db 308 L LQ S RNAAWMAVAQ L YWH ISPKSE — AGVISKSLVRLLRSNREVQYIVLQNI 358 

Qy 61 ENSTCDTDGMYD-ICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFL 109 

: : I I : :. I I I . I : II I I : I I I 

Db 35 9 ATMS I ERKGMFEP YLKS F- YVRSTDPTMI KTLKLEI LTNLANEANI STLLREFQTYVRSQ 417 

Qy 110 AIRRCSTFQRMIAEVQEECYSKLNVCSIAKRNPEAITE WQLPNHFS 156 

I I I : I I : I | | : : I I I : : I : : I I : : 

Db 418 D KQ FAAAT I QT I GRCAT SISEFTETCFNGL-VCLLSNRDEIWAESWVIKKLLQMQ 473 

Qy 157 NRYYNRLVRSLLE-CDEDTVSTIRDSLMEKIGPNMASLFHILQTDHCAQTHPRADFNRRR 215 

: :: I : : I I I I I :: I I I I : I 
Db 474 PAQHGEIIRHMAKFLDSITVPVARASILWLIGEN ' CERVPKIA 515 

Qy 216 TNEPQKLKVLLRNLRGEED 234 

I I : : :: I : I 
Db 516 PDVLRKMAKSFTSEDD 531 



RESULT 15 
SNAA_VITVI 

ID SNAA_VITVI STANDARD; PRT; 289 AA. 

AC P93798; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Alpha-soluble NSF attachment protein (Alpha-SNAP) (N-ethylmaleimide- 



DE sensitive factor attachment protein, alpha) . 

OS Vitis vinifera (Grape) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta ; Tracheophyta; 

OC Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicots; rosids; 

OC Vitaceae; Vitis. 

OX NCBI_TaxID=29760; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Shiraz; TISSUE=Fruit ; 

RX MEDLINE=98182598; PubMed=9522131 ; 

RA Matsumoto S., Dry I.B., Thomas M. ; 

RT ."Nucleotide sequence of grapevine (Vitis vinifera) cDNA similar to 

RT SNAP proteins."; 

RL DNA Seq. 8:109-112(1997). 

CC -!- FUNCTION: Required for vesicular transport between the endoplasmic 
CC reticulum and the Golgi apparatus (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Cytoplasmic peripheral membrane protein (By 
CC similarity) . 

CC -!- SIMILARITY: Belongs to the SNAP family. 

CC ■ 

CC This. SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB001375; BAA19246.1; -. 

DR- PIR; T50776; T50776. 

DR Inter-Pro; IPR000744; NSF_attach. 

DR InterPro; IPR008941; TPR-like. 

DR InterPro; IPR001440; TPR. 

DR Pfam; PF02071; NSF; 4. 

DR PRINTS; PR00448; NSFATTACHMNT . 

KW Transport; Protein transport; Endoplasmic reticulum; Golgi stack. 

SQ SEQUENCE 289 AA; 32485 MW; 939E49A56C1388D4 CRC64; 

Query Match 7.1%; Score 90.5; DB 1; Length 289; 

Best Local Similarity 19.9%; Pred. No. 1.2; 

Matches 41; Conservative 33; Mismatches 95; Indels 37; Gaps 5; 

Qy 8 LLVLVISASATHEAEQNDSVSPRKSRVAAQNSAEWRCLNSALQVGCGAFACLENSTCDT 67 

I : : I : : I I I I : : : : | : || I I : I : I : 

Db 59 LSTVIQSSDSKHEAAQ — AYADAGHC YKKT S AKEAI S CLEQA AYLFLDNGRFNM 110 

Qy 68 DG MYDICKSFLYSAAKFDTQGKAFVKESLKCIANGVTSKVFLAI RRCSTFQR 119 

I : | : : : : | : | : : | | | : | | : : | : 

Db 111 AGKYYKEIAELYELEQNFEQAIIYFEKAADIYQSEEATTAANQCNAKVAQFAAQLEQYQK 17 0 

Qy 120 MI AEVQEEC YS KLN VCSIAKRNPEAITEWQL PNHFSNR 158 

I :: II : | : | : | | | : | | 

Db 171 AIQIYEDIGRPSLNNNLLKYGVKGHLLNAGICQLCKGDWAITNALDRYQEMDPTFSGTR 230 

Qy 159 YYNRLVRS LLECDEDTVSTI RDS LME 184 

I II I I : I I : : I 

Db 231 EYKLLVDLAAAVDEEDWKFTDAVKE 256 



Search completed: July 19, 2004, 15:32:39 
Job time : 15 sees 



